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ABSTRACT 


We use the dist ance probability density function (DPDF) formalism of Ellsworth-Bowers 


et al. (2013 2015) to derive physical properties for the collection of 1,710 Bolocam Galactic 


Plane survey (BGPS) version 2 sources with well-constrained distance estimates. To account 
for Malmquist bias, we estimate that the present sample of BGPS sources is 90% complete 
above 400 and 50% complete above 70 . The mass distributions for the entire sample 

and astrophysically motivated subsets are generally fitted well by a lognormal function, 
with approximately power-law distributions at high mass. Power-law behavior emerges 
more clearly when the sample population is narrowed in heliocentric distance (power-law 
index a = 2.0 ± 0.1 for sources nearer than 6.5 kpc and a = 1.9 ± 0.1 for objects between 
2kpc and 10kpc). The high-mass power-law indices are generally 1.85 < a < 2.05 for 
various subsamples of sources, intermediate between that of giant molecular clouds and 
the stellar initial mass function. The fit to the entire sample yields a high-mass power-law 
a = 1.941q‘^q. Physical properties of BGPS sources are consistent with large molecular cloud 
clumps or small molecular clouds, but the fractal nature of the dense interstellar medium 
makes difficult the mapping of observational categories to the dominant physical processes 
driving the observed structure. The face-on map of the Galactic disk’s mass surface density 
based on BGPS dense molecular cloud structures reveals the high-mass star-forming regions 
W43, W49, and W51 as prominent mass concentrations in the first quadrant. Furthermore, 
we present a 0.25-kpc resolution map of the dense gas mass fraction across the Galactic disk 
that peaks around 5%. 

Subject headings: Galaxy: structure - ISM: clouds - stars: formation - submillimeter: ISM 


1. INTRODUGTION 

As the possible precursors to stellar clusters, OB 
associations, or smaller stellar groups, molecular 
cloud clumps and cores have become a primary 
focus for understand ing the process of high- mass 


star formation (e.^., McKee & Ostriker 2007 and 
references therein). The recent advent of large- 
scale continuum surveys of the Gal actic plane at 


20111 [Ginsburg et al.j 

poTsl 

ATLASGVC: 

et al.|2009t Hi-GAL, Molinari et al.|2010b) 


! de- 


dust emission. A major payoff of blind surveys in 
this portion of the electromagnetic spectrum is the 
derivation of physical properties of regions hosting 
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high-mass star formation. A detailed census of these 
dense molecular cloud structures can help constrain 


Kennicutt & Evans 20121). Studies of the physical 

properties (e.q., Feretto & Fuller 2010 

Giannetti 

et al. 

20131) and mass distributions ( e.g., 

INetterfield 

et al. 

2009 |(Jlmi et al.||2013| |G6mez et al.||2014p of 


these objects have begun, but despite the richness 
of the current data sets, a coherent picture has not 
yet emerged for the evolution of the dense interstel¬ 
lar medium and the origin of the stellar initial mass 
and stellar cluster mass functions. 

While giant molecular clouds (GMGs) have been 
studied for several decades using the lowest rota¬ 
tional transitions of the isoto pologues of CO and 
other simple molecules (e.q., Scoville fc Solomon 
1975| [Cohen et al.||198Q| [Dame et al.j I987[ |2UU1[ 
and references therein), and studies of the appar¬ 
ently unifo rm stellar and clust er initial m ass func¬ 
tions { e.q., [Bastian et al.|2010) go back tojSalpeter 
( 1955|), the observational technology (detectors and 
angular resolution) has only recently developed to 
study the dense substructures intermediate between 
these two extremes. And while studies of nearby 
molecular cloud complexes have recentl y yielded es- 


Motte et al. 


timates of the core mass function ( e.q., _ 

2007 Enoch et al. 2007 Swift & Beaumont|20I0|) for 

the possible progenitors of single stellar systems, it 

has only been with the recent large-scale blind sur¬ 
veys that the study of the clump mass function has 
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been possible. 

Theoretical modeling of molecular cloud structure 
evolution is beginning to pla ce constraints on the 
clump mass function {e.g.^ Donkov et ah |2012l 
Veltchev et al. 2Q13| ), and two primary tunc 


forms are discussed in the literature: the power-law 
and lognormal distributions. Both arise from phys¬ 
ical processes in molecular clouds; supersonic tur¬ 
bulence within molecular cloud s produces a lognor- 


mal dens ity distribution {e .g., Padoan et al.||1997 


although [Tassis et al.|2QlQ| find other means to pro¬ 
duce a lognormal), while gravitational collapse of 
dense structures tends to pr oduce a power law {e.g.^ 
Padoan & Nordlund||2002|). The distinctions be- 
tween these have implications fo r competing theo - 
ries of high-mass star formation (|Elmegreen||l985|) . 
Eurthermore, the observed mass mnction for dense 
molecular cloud structures should resemble a combi¬ 
nation of these forms due to the complex interac tion 
of physical processes (e.^., Offner et al. 2014); for 


ipOry 

distribr 


a discussion of the details, see Hopkins 
In a broader context, the Galactic distribution 
of dense molecular gas and star formation has 
implications for using the Milky Way as ground 
truth for studies of extragalactic star formation 
and galaxy formation. Cosmological simulations of 
galaxy formation and evolution have made remark¬ 
able progress in the last two decades reproducing 
some of the observed properties of galaxies (e.g., 
Katz et ar||1996| |Kerei et al.||20Q^ |Stinson et al.| 
2013), but numerical simulations have largely been 
unable to a priori produce the strong winds and in¬ 
stantaneous ineffiviency_ofst^formation observed 
in galaxies ( Hopkins et al.|[2M4 ). By incorporating 
feedba ck from stars via 'sub-grid’ physics recipes 
(e.^., Kim et al. 2014), numerical galaxy evolu¬ 
tions niodels can now reliably produce s piral galax¬ 


ies. Very recently, Hopkins et al. (2014) introduced 
simulations that do include realistic feedback to pro¬ 
duce galaxies commensurate with locally observed 
examples. The continued refinement of these mod¬ 
els must rely upon the constraints of derived phys¬ 
ical properties of dense molecular cloud structures 
in the Milky Way. Additionally, Galactic structures 
can provide a comparison for scaling relationships 
for star formation rates with respe ct to more active 
galaxies ( Kamenetzky et aL]|2Q14 ). 

As the first completed dust-continuum millimeter- 
wavelength survey of the northern Galactic plane, 
the Bolocam Galactic Plane Survey (BGPS) has 
provided a foundation for studying dense molec¬ 
ular cloud structures in a variety of Galactic en¬ 
vironments. The majority of objects detected in 
the BGPS correspon d to molecular cloud clumps 
( Dunham et al.||2Qll , hereafter Dll), each of which 
will likely form a cluster of starsDerivation of 
physical properties of these objects requires r obust 


distance estimates; 


Ellsworth-Bowers et al 


(2013 


Molecular cloud clumps are typically fO^ — fO^ Mq with 
sizes 0.3 — 3 pc (lBe^i^_&_^^fel^[200^ . Following Dff, we 
generally adopt tJie |.bergin ^ 'l'atalla| classifications for dense 
molecular cloud structures. 


2015|) developed a distance probability density func¬ 
tion (DPDE) methodology for utilizing all available 
information towards this goal. By combining kine¬ 
matic distance information with prior DPDEs based 
on ancillary data (such as Spitzer/GLIMPSE mid- 
infrared images and trigonometric par allax mea¬ 
surements) in a Ba yesian framework, [Ellsworth 


Bowers et al.j (|2015| hereafter EB15) presented a 
catalog of 1,710 BGPS objects with well-constrained 
distance estimates. 

In this work, we build on that distance catalog 
to derive the physical properties of and fit mass 
functions to this vital population of sources. The 
probabilistic description of the distance to each ob¬ 
ject in the survey catalog may be used to directly 
propagate uncertainties into the derived physical 
properties via Monte Carlo methods. We present 
here a catalog of physical properties for this set of 
sources, and fit mass functions to the entire sample, 
as well as astrophysically motivated subsets. F^ur- 
t her more, we explore the extension o f fundamental 
molecular cloud scaling relationships (|Larson||1981 ) 


to dense substructures and map the distribution of 
dense molecular gas throughout the Galactic disk. 

2. DATA 

The Bol ocam Galactic Plane Survey version 2 


(BGPS V2; Aguirre et al. 2011 Ginsburg et al. 2013 
hereafter G13), is a A = 1.1 mm continuum sur¬ 
vey covering 192 deg^ at 33" resolution. It is one 
of the first large-scale blind surveys of the Galac¬ 
tic plane in this region of the spectrum, covering 
— 10° < i < 90° with |6| < 0?5 and selected larger 
cross-cuts to |6| < 1?5, plus selected regions in the 
outer Galaxy. Eor a map of BGPS V2 coverage and 
details about observation methods and the data re¬ 
duction pipeline, see G13. Erom the BGPS V2 im¬ 
ages, 8,594 millimeter dust-continuum sources were 
identified using a custom extraction pipeline. BGPS 
V2 pipeline products, including ii nag e mosaics and 
the catalog, are publicly available 

The BGPS data pipeline removes atmospheric 
signal using a principle component analysis tech¬ 
nique that discards common-mode time-stream sig¬ 
nals correlated among bolometers in the focal plane 
array. The pipeline attempts to iteratively identify 
astrophysical signal and prevent that signal from 
being discarded with the atmospheric signal. This 
sky-subtraction behaves like a high-pass filter that 
may be characterized by an angular transfer func¬ 
tion passing scales from approximately 33" to 6' (see 
G13 for a full discussion). The effective angular 
size range of detected BGPS sources therefore cor¬ 
responds to anything from molecular cloud cores up 
to entire GMCs depending on heliocentric distance 

(Dm ___, 

[Ellsworth-Bowers et al. (2015) presented a dis- 
tance catalog for BGPS V2 sources using a Bayesian 
DPDE framework. The DPDEs describe the rela¬ 
tive probability of finding an object at points along 

Available through IPAC at 
http://irsa.ipac.caltech.edu/data/B0L0CAM_GPS 
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the line of sight and are constructed from multiple 
sources of distance-related information via 

DPDF(4) = £Ksr,^,&;4) (1) 

i 

where C{vi,sR^l,b] d^) is the kinematic distance 
likelihood function based on a line-of-sight velocity 
('^lsr)^ are prior DPDFs based 

on ancillary data. Given circular orbits about the 
Galactic center, there are two points along a line of 
sight looking interior to the Solar circle that corre¬ 
spond to a single the kinematic distance am¬ 

biguity (KDA). The jC{vi^sR^^^b; d^) for such ob¬ 
jects is double-peaked, representing the equal prob¬ 
ability of finding the object at either kinematic 
distance; prior DPDFs are used principally to re¬ 
solve the KDA, placing an object at either the 
near or far distance (see FBI5 for further discus¬ 
sion). The resulting catalog contains 1,710 sources 
with well-constrained distance estimates (distance 
uncertainties generally < 0.5 kpc), hereafter re¬ 
ferred to as the Distance Gatalog. The posterior 
DPDF^^are normalized to unit integral probability 
(Jq DPDF dd^ = 1) to facilitate marginalization 
over distance, and allow for the Monte-Carlo prop¬ 
agation of uncertainty in an object’s distance to the 
derived physical properties by randomly sampling 
the DPDFs many times. 


3. METHOD 

3.1. Computing Physical Properties Using the 
DPDF 


It is straightforward to compute physical proper¬ 
ties of dense molecular cloud structures using single¬ 
value distance estimators. The complete informa¬ 
tion contained in an object’s posterior DPDF, how¬ 
ever, represents the combination of knowledge and 
ignorance about its distance based on a wide col¬ 
lection of data encapsulated in the suite of prior 
DPDFs applied. Use of this full information, there¬ 
fore, requires careful consideration. Below, we dis¬ 
cuss mass c alculation for BGPS objects in some de¬ 
tail (^3.1.1), and then apply those metho ds to t he 
computation of other physical properties ( p.1.2 ). 


3.1.1. Mass Derivation 

A simple estimate of a dense molecular cloud 
structure’s mass may be computed from optically 
thin millimeter dust continuum data via 


M = 


r dl 

Ky By{Td) 


( 2 ) 


where Sy^int is the source-integrated flux density, 
d^ is the estimated heliocentric distance, Ky is the 
opacity per mass of dust, r = {Tn^/m^) is the gas-to- 
dust mass ratio, and By{Td) is the Planck function 
evaluated at dust temperature Td. For the specific 


Available with the BGPS release at IPAC. 


case of the BGPS, 


M = 13.1 


13.0 K/20.0 K _ 1 y Jy y 

( 3 ) 

where S-^ ^ is the A = 1.1 mm source-i ntegrated flux 
density, tiy = 1.14 cm^ g“^ of dust ( Ossenkopf fc 



id = zujx. t:3ranaara error-propagation 
may be used to determine the mass uncertainty, but 
the asymmetric nature of many DPDF error bars 
(EB15) complicates the issue. The uncertainty in 
the distance may be marginalized over to produce 
the expectation value of the mass via 


(M) = 


r5, 


u,int 


Ku B^Td) 


f 


DPDF dl dd^ , 


( 4 ) 


where (M) is proportional to the second moment of 
the DPDF. The bimodal nature of many posterior 
DPDFs in the inner Galaxy, however, causes this to 
be a biased estimator of the mass, in much the same 
way d (the first-moment distance) can be a biased 
estimator of the distance. For instance, an object 
with a well-constrained distance estimate may have 
up to ^ 20% of the integrated DPDF in the non- 
favored kinematic distance peak; the difference be¬ 
tween d and the maximum-likelihood distance {d^^) 
can be gre ater than 1 kpc, or > 2x the u ncertainty 
in Ellsworth-Bowers et al.||2Q13 for a com¬ 

plete discussion). 

This bias, coupled with the measured source flux- 
density and assumed dust temperature uncertain¬ 
ties, suggests using a Monte Garlo approach for 
estimating an object’s mass and associated uncer¬ 
tainty. To determine the maximum-likelihood mass 
for a given catalog object. Equation Q is com¬ 
puted many times with each realization^rawing a 
distance, flux density, and temperature from the 
suitable probability density functions. The result¬ 
ing (properly normalized) mass probability density 
function (MPDF) may then be used to estimate the 
maximum-likelihood mass {M^^) and the 68.3% er¬ 
ror bars in the same fashion as done with DPDFs 
in EB15. 

Being the probability of an object having a mass 
between M and M + dM, the MPDF of a given 
object is created from the histogram of masses pro¬ 
duced by maw (^ 10^) Monte Carlo realizations 
of Equation Q. Given the fractional nature of the 
uncertainties m the derived mass, we chose to use 
logarithmic binning for computation of MPDFs and 
the probability density functions for other physical 
properties. Note that the computation of DPDFs 
(EB15) is done with respect to a fixed linear dis- 


This table of Ky represents dust grains with ice mantles, 
coagulating in cold, dense molecular regions at n = 10^ cm“^ 
for 10® yr. This is appropriate for molecular cloud co res, and 
may be an upper limit for less-dense clumps {e.g.^ [Martin] 
jet al.|2012|). The computed masses may, therefore, be biased 
low, b ee a recent study of dust opacity for dense molecular 
gas by jJuvela et al.j ( |2015| > . 
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tance scale and is not subject to the need to create 
histograms from Monte Carlo calculations. 


3.1.2. Physical Radius and Number Density 

In addition to the masses of dense molecular cloud 
structures, the catalog of heliocentric distances en¬ 
ables the computation of physical sizes and num¬ 
ber densities of these objects. Not only can these 
physical properties help to disentangle the various 
populations of objects (z.e., cloud, clump, core) de¬ 
tected by the BGPS and others, but they can also 
test the extension of various empirical relationships 
for GMCs down to more dense substructures. 

The physical radius is computed from 

i? = 0.29pc , (5) 

yarcmmy ykpcy 


where Or is the deconvolved angular radius of a de¬ 
tected object computed from geometric mean of the 
deconvolved major (cTmaj) and minor (cTmin) axes of 
the flux density distribution. 


Or = 11 [(c^maj - ^bm) (^^min “ ^^bm)] 


1/4 


(6) 


rms 


(Rosolowsky et al. 2 Q 1 Q[ ). For the BGPS, the 

size of the beam is given by (Jbm = ^fwhm / V81n2 = 
14", and 77 = 2.4 is a factor relating the rms size 
of the emission distribution to the angular radius 
of the object PI Objects whose measured minor 
axis dispersion is smaller than (Jbm bo not have a 
real solution to Equation 0; actual physical ob¬ 
jects should not fall into this category, but effects 
of the cataloging process lead to 29% of the full 
BGPS V 2.1 catalog and 20 % of the Distance Gata- 
log having a complex computed Or. These objects 
are excluded from the analyses of physical radius 
and number density in Q 

As a means for classifying dense molecular cloud 
structures, the mean number density offers possible 
insight into the physical processes, such as structure 
formation via turbulence or gravitational collapse, 
that may be at play in a given catalog object. The 
number density computed with assumed cylindrical 
geometry (z.e., a circle on the sky with radius R, 
and depth i?)p]n = M/TTiirriHR^^ may be param¬ 
eterized in terms of mass and radius or observable 


The appropriate value of 77 to be used depends on the 
true emission distribution of the object and its size relative 
to the beam. |Rosolowsky et al.| (|2010|) computed 77 for a 
large range of models, varying tne emissivity distribution, 
angular size relative to the beam, and signal-to-noise ratio. 
The chosen value for the BGPS catalog is the median value 
from these simulations, but the variations span more than a 
factor of two. Use of a specific source model in conjunction 
with the BGPS catalog would require the appropriate value 
of 77 and rescaled catalog values for Or 

This parameterization of the volume results in U = ttR^; 
assuming spherical geometry increases the volume by a factor 
of 4/3, and assuming a c ylind er of depth 2R increases it by a 
factor of 2 . As shown in §4.1| the number density is a poorly 
constrained quantity for these objects, and source-to-source 
geometry differences play a significant role. 


quantities as 

n = 4.60 X 10^ cm“^ 

= 2.47 X 10 ^ cm -3 

fAij ( 

V Jy / 


M 


100 M 


o 

13.0 K/Td 


gl3.0 K/20.0 
-3 


(N 

Vkpc 



X 

( 7 ) 


The first parameterization relates the physical 
quantities in units appropriate to dense molecular 
cloud structures, while the second is specific to the 
observational quantities of the BGPS and the as¬ 
sumptions used in Equation Note that the 

number density is not as strong a function of he¬ 
liocentric distance as is the mass; incorrect KDA 
resolution will have a smaller effect on the derived 
value. 


3.2. Constructing Mass Distributions and 
Estimating Completeness 

The mass function describing the relative num¬ 
bers of dense molecular cloud structures is of partic¬ 
ular interest for connecting star-formation theory to 
observation. Functional forms are fitted to the dis¬ 
tribution of masses and we explore some of the pos¬ 
sible mass distribution constructions. Additionally, 
since the range of masses over which conclusions 
may be drawn is restricted by completeness consid¬ 
erations, we present an analysis estimating the mass 
completeness levels for a heterogenous data set such 
as the BGPS. 


3.2.1. Mass Distributions 

In its simplest form, the mass distribution of 
dense molecular cloud structures may be compiled 
from the direct application of Equation <§ to each 
object with a well-constrained distance estimate, 
where is either the maximum-likelihood dis- 
tance or the first-moment dis tance, as discussed in 
Ellsworth-Bowers et al. (2013). This form of the dis- 
tribution lor the Distance Catal^ sources is shown 
as the red histogram in Figure fp {left) and repre¬ 
sents a naiVe, but effective, realization of the mass 
distribution. 

While the simple mass distribution utilizes the 
best distance estimate from the DPDF formalism, 
it discards much of the information contained in the 
posterior DPDFs. Given the asymmetric nature of 
many DPDFs, the most straightforward way to cap¬ 
italize on this pool of information is through Monte 
Garlo realizations utilizing the MPDFs. Plotted 
in black in Figure {left) is the mass distribu¬ 
tion computed using me from the Monte-Garlo 
generated MPDFs. While there are noticeable de¬ 
viations in the wings of the distributions, a two- 
sample KS test between these two distributions re¬ 
veals that they are indistinguishable to better than 
99.9%. Despite the negligible differences between 
the distributions, the power of the MPDF-based 
mass distributions lies in the ability to create many 



















BGPS XIIL Physical Properties 


5 



Figure 1. Comparison of mass computation methods. Left: The mass distribution computed b y applying Equation ([^ to each 
object, using the appropriate single distance estimate from the posterior DPDF (as discussed in |Ellsworth-Bowers et 
‘Simple Mgim’, red) plotted atop the mass distribution using the maximum-likelihood masses trom the MPDF s ClVLaximum- 
Likelihood black). Also shown is the Monte Carlo composite mass distribution, as discussed in ^4.2. 1| (hashed cyan). 

Middle: Comparison of Mj^^with Msim for each object in the Distance Catalog. The red dashed line marks the 1:1 relationship. 
Right: Histogram of the ratio M^^/Mgirn for each object in the Distance Catalog, with Caussian fit overlaid in red. 


Monte Carlo realizations of the mass distribution, 
randomly drawing a mass from each object’s MPDF 
for each trial. 

If the aggregate mass distributions between the 
two methods are indistinguishable, what of the 
masses of individual objects? The comparison of 
the mass derived from a simple application of Equa¬ 
tion using the single-value distance estimate 
from EB15 (Msim) with the maximum-likelihood 
value from the MPDF is shown in the middle 

panel of Figure where the dashed line marks the 
1:1 relationship. The histogram of the ratio of these 
masses is plotted in the right panel 

along with a Gaussian fit. The mean of the dis¬ 
tribution is consistent with unity, and has a stan¬ 
dard deviation of « 2%. Differences between the 
masses are more significant for low-mass objects 
(M < 100 Mq), with Mj^L^^coming larger than the 
simple mass by up to 60% in the most extreme case. 

Going beyond constructing a mass distribution for 
the Distance Catalog using a single mass for each 
object, a more complete Monte Carlo approach is 
shown as the hashed cyan histogram in Figure 
(left). Rather than taking the maximum-likelihood 
value from the MPDF of each object (as does the 
black histogram in that panel), each object’s MPDF 
was randomly sampled 20,000 times to produce a 
mass distribution containing 34.2 million entries. 
The resulting histogram was then normalized to rep¬ 
resent the N = 1710 objects of the Distance Cat¬ 
alog. The resulting distribution is much smoother 
at both mass extrema because of the lack of small- 
number statistics. Interestingly, the mode (peak) of 
the distribution is shifted to slightly higher mass, 
likely due to the dependence in Equation (^. 
This Monte Carl o com posite distribution is dis¬ 
cussed further in §4.2. 1[ 

3.2.2. Mass Completeness Function 

(Sub-)Millimeter continuum surveys of thermal 
dust emission are fiux-limited, so Malmquist bias 
must be addressed when analyzing the mass distri¬ 



Figure 2. Computed mass completeness levels. Cumulative 
distribution of the mass corresponding to 5x rms flux-density 
noise and the source distance from the posterior DPDF. The 
cyan curve represents the ‘Simple’ distribution, using the 
appropriate single distance e stimator from the DPDF (see 
|Ellsworth-Bowers et al.|2013| >. The black curve is the aggre- 
gate of 10'^ Monte Carlo realizations, with each realization 
randomly drawing a distance from each object’s posterior 
DPDF and dust tempe rature from the lognormal distribu¬ 
tion discussed in §4.1.1| Dotted horizontal lines identify 50% 
and 90% completeness. Light vertical dashed and dot-dashed 
lines identify the 50% completeness levels for both distribu¬ 
tions, and the dark vertical lines mark the 90% completeness 
levels. 

bution of detected objects. It is relatively straight¬ 
forward to identify fiux-density completeness levels, 
but since higher mass (and therefore brighter) ob¬ 
jects are detectable through a larger Galactic vol¬ 
ume than low-mass objects, it is imperative to define 
the mass criteria at which a survey is complete to 
some level. While small s urveys of isolated regions 


at known distances (e.q.. 

Ridge et al.||2006 

|Enoch 

et al. 2006 

Rosolowsky et al.|2008) can direct 

4y esti- 


mate strict mass completeness levels, heterogenous 
surveys of the entire Galactic plane must be content 
to have a more complex criterion. 


We cast the issue of completeness levels in terms 
of a mass completeness function^ which describes 
the fraction of objects in the survey area at any 
given mass that should be detected. Mass complete- 
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ness begins with flux-density completeness, usually 
defined in terms of the rms noise in survey maps. 
For instance, the BGPS VI catalog is 99% complete 
at 5x the rms noise flux density for any given field 
(Rosolowsky et al.||2010), a value that is unchanged 
in version 2 (G13). While surveys that do not have 
uniform noise properties across the Galactic plane 
pose an additional complication to the process, vari¬ 
able backgrounds throughout the Galactic plane (es¬ 
pecially at shorter wavelengths) will also affect the 
completeness calculation. The flux-density com¬ 
pleteness level must, therefore, be estimated on a 
field-by-fielq^ (or, better, source-by-source) basis. 
Estimates oirms noise in survey images, however, 
usually do not take the confusion limit into account, 
so these completeness levels should be viewed as 
lower limits. 

To begin, we compute the local flux-density com¬ 
pleteness level for each catalog object as 5x the 
mean of the BGPS n oise maps within the ob jecPs 
catalog boundary (see|Rosolowsky et al.|2010|). The 
minimum complete flux density tor each object im¬ 
plies that any source at this location in the sur¬ 
vey map brighter than this value will be detected 
> 99% of the time. Approximately 13% of the 
objects in the Distance Gatalog have flux densities 
below their own completeness value, meaning that 
they are < 5cr detections. These objects represent 
a tip of the iceberg; similar objects exist in those 
same fields but were not recovered. 

Gombining the completeness flux density with 
the object’s distance information allows an esti¬ 
mate of the “minimum complete mass” for each 
object, meaning that any object at the specified 
(^, ) with mass higher than this value will be 


detected > 99% of the time. Repeating this process 
on a source-by-source basis allows for the effective 
marginalization over the variable noise properties 
across the Galactic plane. The mass completeness 
function is therefore the cumulative distribution of 
the minimum complete mass for all objects in the 
sample. 

Figure illustrates the mass completeness func¬ 
tions computed using the two techniques described 
in Figure The completeness function for the sin¬ 
gle distance estimate from EB15 for each object is 
plotted in cyan, and an aggregation of 10^ Monte 
Garlo realizations pulling distances from each ob¬ 
ject’s DPDF is shown in black. The curves are sim¬ 
ilar, but a two-sample KS test reveals they are not 
drawn from the same population at greater than 
99.5% significance. Shown as horizontal dotted lines 
are the 50% and 90% completeness levels, and the 
vertical lines mark the intersections of the curves 
with these levels. The 50% completeness level is 

70 Mq for both curves, and the 90% completeness 
level is ~ 400 . The importance of these values 

is in limiting conclusions about the mass distribu¬ 
tions (or other derived physical properties) to larger 
masses; extending conclusions to smaller-mass ob- 


The BGPS has a variable rms noise from field to field 
(see G13, their Figures 1 and 2). 


jects is not supported by the available data. 


3.3. Fitting a Functional Form to the Mass 
Distribution 

To compare the observed mass distribution for 
dense molecular cloud structures with theory and 
other observations, we fit both power law and log¬ 
normal functions to the data. The formulae and 
procedures for this fitting are discussed below, and 
are based on maximum-likelihood methods to avoid 
the pitfalls of working with binned data. 


3.3.1. Power Law 

The use of power-law functions for describing the 
mass distributions of stars and their precursors goes 
back to the original studies o f the stellar initial mass 
function by Salpeter (1955). The mass function is 
defined as the number of objects either per logarith¬ 
mic mass interval. 


C(logM) = 


dN 

dlogM 


or per linear mass interval, 

'' ' dM M In 10 


(X M 


1 


(8) 


dN 


Min 10/ dlogM 


19 ) 


where ^(M) oc M“^, and x = a —1 (|Ghabrier|2QQ3D . 
Within this framework, the |balpeter| power- law in- 
dex is X = 1.35, a = 2.35. For a detailed description 
of this parameterization and its consequence s, see 
Swift & Beaumont| (|2010|) , |01mi et al.| (|2013|) , and 
references therein. T'he mathematical nature of a 
power law requires a finite lower limit on the mass 
range over which it is a valid descriptor of the data 
in addition to the observational constraint that the 
mass distribution turns over at M < 1^© 
Kroupa||2001|). 

T'o fit a power-law function to the observed 
mass distribution, w e use the max i mum- likelihood 
method described by Glauset et al. (2009|^^ to esti¬ 
mate the power-law index and range over ^ich it is 
valid. Briefly, assuming a > 1 and M^ax ^ Afmin, 
the probability (or likelihood) that the observed 
masses (M^) are drawn from a power-law distribu¬ 
tion is proportional to 


Ppi (M I a) 


n 


a — 1 


M„ 



( 10 ) 


where a is defined as in Equation The 

maximum-likelihood value is computed oy taking 
the derivative of the logarithm of the likelihood 
(£l^with respect to a and finding the root (z.e., 
dC/oa = 0). The power-law index estimator is 


a = 1 n 


E 


Mi 

Mmin 


( 11 ) 


http: //tuvalu. santaf e. edu/~aaronc/powerlaws/ 

The log-likelihood is used for mathematical expediency, 
as the product in Equation M becomes a sum. While the 
numerical value of the function is changed by taking the log¬ 
arithm, the location of the maximum is not. 
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The minimum mass (Mmin) for which a power law 
is a good descriptor of the mass distribution is 
computed from the data. For each value in the 
mass distribution, Equation ( pT] ) is computed using 
that mass as Mmin- The cuumlative distribution of 
M > Mmin is compared with the analytic power law 
with index d, and the maximum difference between 
them is computed (ie., the K-S test T)-statistic); 
the value of Mmin that minimizes D is returned by 
the algorithm (see |Clauset et al.|2QQ9 j 
algorithm), 'lue res 


for a complete 


description of the algorithm). due results presented 
here were computed us ing a pyth o n tra nslation of 
the PLFIT routine from IClanset et allPI 

3.3.2. Lognormal 

Predicated on the properties of supersonic turbu¬ 
lence and the turnover in the IMF at M < 1 , 

the lognormal form of the mass function is also com¬ 
monly fit to the mass distr ibutions of dense molecu¬ 
lar cloud structu res (e.^., Swift & Beaumont||2010i 
Olmi et al.|2Q13 ). Analogous to the power-law case, 


we use a maximum-likelihood method for comput¬ 
ing the best fit to the mass distribution. The like¬ 
lihood of the observed masses being drawn from a 
lognormal distribution is proportional to 


Pin (M I/i, ct) = F[ ^exp 


i=l 


M 


(lnM-/r)2 


2cr2 


=n . 


i=l 


( 12 ) 


where the normalization factor 


C\n — xj 2 [6^fo(^min) 6I’fo(^max)] : 


and erfc(x) is the complementary error function 
evaluated at x. The parameters [/i, a] are the mean 
and width of the lognormal Gaussian, respectively, 
and [xmin, ^max] are related to the minimum and 
maximum masses, respectively, over which the log¬ 
normal fit is valid. The maximum-likelihood es¬ 
timators [/i, a] are found by numerically solving 
V^,cr = 0 (z.e., numerically maximizing £), where 

£ = In Pin (M I p, cr) 

n n 

= n In On - . (14) 


i=l 


i=l 


^min? AfjxiaxJ 


As with the power-law case, the limits [M^ 
over which the lognormal fit best describe the data 
must be estimated from the data themselves. We 
employ the same K-S test T)-statistic analysis as de¬ 
scribed above, but use a numerical minimizer to ex¬ 
plore the 2-dimensional parameter space including 
Mmax- Following the algorithmic structure of PLFIT 
(^3.3.1), we wrote a routine in python to maximize 
Equation (14) using Powell’s Method inside a nu¬ 


merical minimization of the K-S test T)-statistic us¬ 
ing a downhill simplex optimization!^ 

4. RESULTS 

4.1. Physical Properties of BGPS Sources 
4.1.1. Catalog 

We present a physical properties catalog for the 
1,710 sources in the Distance Gatalog. The mass, 
physical radius, and number density for each object 
were computed from the Monte- Gar lo probability 
density functions as described in §3.1[ where a dust 
temperature was drawn for each realization from 
a lognormal distribution with a m ean of 20 K and 


full-w idth at half-maximum of 8K (Battersby et al. 
2011). This choice of temperature distribution was 


https://github.com/keflavich/plfit 


recently affirmed by the analysis of analogous dense 
molecular cloud structures fro m the ATLASGAL 
survey by Wienen et al. (|2015). We note that the 
study of IVlerello et al.|(|2015|) combining A = 1.1 mm 
and A = 350 /im data hnds a lower mean tempera¬ 
ture for BGPS sources {Td = 13.3 K for dust spec¬ 
tral index [3 = 2.0, and = 16.3K for [3 = 1.7). 
Using a mean Tg = 2t)K would lead to an under¬ 
estimate of the mass of each object by a factor of 
1. 3 to 1.8 for the S = 1 .7 and P = 2.0, respectively, 
of Merello et al. (2015). While the computed mass 
and number density values are affected by this un¬ 
certainty in temperature, Monte Carlo simulations 
of BGPS so urce mass distributions by |Schlingman| 
et al.| (|201 1|) show ed the power-law slope of the mass 
function (^4.2.2) is not very sensitive to the choice 
of underlying dust temperature distribution. 

The pdfs for mass, physical radius, and number 
density were constructed in log space to capture the 
spread in values over orders of magnitude, and er¬ 
ror bars were computed to enclose 68.3% of the to¬ 
tal probability, where the endpoints occur at equal 
probability (see the discussion of DPDF error bars 
in EB15). The error bars approximate the Gaussian 
± 1(7 region, but account for the asymmetric nature 
of the pdfs. The catalog of physical properties is 
shown in Table along with relevant information 
from the BGPS V2.1 catalog (G13) and heliocentric 
distance (EB15). 

Of particular note in the catalog are the 20% of 
sources in the BGPS Distance Gatalog whose angu¬ 
lar extents on the sky do not yield real solutions to 
Equation ^ for Or. A brief analysis of these objects 
in the context of the flux-density distribution for 
all catalog sources reveals a strong trend whereby 
objects with non-physical Or are preferentially dim 
(Figure!^. The gray and blue histograms represent 
the “unresolved” sources in the full V2.1 catalog and 
the Distance Gatalog, respectively, while the black 
and red histograms represent the resolved sources 
in the same groups. 

The obvious correlation in the data points in the 
top panel (Spearman rank correlation coefficient 
p = 0.753) is a direct consequence of a universal 

Both optimization steps utilized routines from the scipy 
library (http://www.scipy.org). 
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Table 1 

Physical Properties of BGPS Distance Catalog Sources 


BGPS V2.1 Gatalog Properties 



Derived from the PDFs^ 

Gatalog 

1 

b 

Sint° 


d ^ 

M 

R 

n 

Number 

n 

n 

(Jy) 

(") 

(kpc) 

(logMg) 

(pc) 

(log cm“^) 

2235 

7.993 

-0.268 

6.23(0.49) 

74.3 

11 94+®-^® 
ii.yu_o 42 

-0.11 

4 31+0.19 
^•'^-'■-0.16 

901 +0.12 

2254 

8.187 

0.482 

0.61(0.15) 

35.9 

1 64+°-®^ 

1 47 + 0.43 
-0.71 

0 3fi+0-ii 

3 30+0.39 

9.00-0 28 

2256 

8.207 

0.190 

1.44(0.22) 

56.7 


z.ua-o 50 

n 73+0.19 
' '^-0.28 

3 24+®-^® 
'^•^^-0.23 

2261 

8.249 

0.180 

2.29(0.29) 

77.6 


^•^®- 0.30 

1 oq+0.19 
-^•^'^-0.29 

2 Q3+0-19 
^•^'^-0.17 

2262 

8.263 

0.168 

2.82(0.32) 

79.8 


2 43+0.27 
^•^'^-0.38 

i.uy_o 37 

3 (33+0.22 

'^•0'^-0.19 

2265 

8.281 

0.164 

1.00(0.15) 

23.8 

C).UD_o 88 

z.ua-0 32 

0 3fi+0-06 

U.90_q Qg 

4 13+0.19 
^•-'■'^-0.18 

2295 

8.545 

-0.342 

1 .00(0.20) 

35.5 

U.C)U_o 78 

2 3fi+0+6 

z.ou-o 18 

D 73+0-06 

O'-' ^-0.09 

3 44+9-21 
'^•^^-0.22 

2296 

8.551 

-0.296 

0.25(0.10) 


iZ.ZU_Q 4Q 

r\ 7(^ + 0.19 
'^-0.21 



2297 

8.579 

-0.344 

0.86(0.18) 

37.3 


9 9i=;+0.18 

Z.ZO-o 21 

n 70+0.08 

O'- '-J-O.IO 

q q3+0.17 

o.oo-o 16 

2300 

8.669 

-0.406 

0.44(0.11) 



z.yz-o 16 




Note. — This table is available in its entirety in a machine-readable form. 

^ See pT] 

^ The source-integrated A = 1.1 mm flux density, uncertainties in parentheses. 

^ The deconvolved radius of the catalog object, computed using Equation 

^ The appropriate distance estimate from EB15; d for objects near the tangent point and 
otherwise. 



Figure 3. Brief analysis of resolved versus unresolved BGPS 
V2 sources. Top: Scatter plot of deconvolved radius versus 
integrated A = 1.1 mm flux density. Evenly spaced contours 
represent the density of points per 0.1 dex x 0.1 dex bin. 
Only reso lved sources are shown. Plotted in green are Equa¬ 
tion (|17|) evaluated at = 1 kpc (solid), 5kpc (dotted), 
and 20kpc (dashed) (see p.2| for discussion). Bottom: His¬ 
tograms of integrated A = i.i mm flux density. Black repre¬ 
sents all resolved sources (ie., the projection of the points in 
the top panel), and the gray diagonally hashed histogram rep¬ 
resents the unresolved sources. Red horizontally hashed and 
blue solid histograms represent the resolved and unresolved 
populations, respectively, of the Distance Catalog (EB15). 


scaling relations hip in the dense interstellar medium 
first identified by Larson (1981) whereby the mass of 


a molecular cloud structure is rou ghly proportional 
to the square of its radius. Section [5^ explores this 
relationship in more detail, culminating in Equa¬ 
tion ( p!7[ ), which is plotted as green lines in Eigurej^ 
for thr ee differ ent heliocentric distances. 

The |Larson| relationships in the figure extend to 
low flux-density, but since the cataloging process 
truncates source boundari es where the flux density 
meets the local rms value (Rosolowsky et al. 
source masks do not extend to the theoretical zero- 
flux-density isophot. The resulting catalog source 
may have an extent in the map smaller than the 
beam size, leading to a complex solution to Equa¬ 
tion (|6). To further aggravate the situation, the 
beam aeconvolution step assumes that sources are 
spherical (or at least have circular projections on the 
sky), which is invalid for many of the fainter filamen- 
tary structures throug hout the Galactic plane (e.^.. 


Molinari et al. 2010a). Euture cataloging routines 
may be constructed that divide emission between fil¬ 
amentary and compact sources, and may decrease 
the number of BGPS sources with invalid decon¬ 
volved angular extent. Eor the present, however, 
unresolved sources are excluded from the following 
analyses that rely on physical radius or number den¬ 
sity, leaving a sample of = 1369 objects with a 
tabulated physical size. 


4.1.2. Ensemble Physteal Property Distributions 

The ensemble physical property distributions 
from Table are shown in Eigure to illustrate 
the range ofBGPS objects in the Distance Gata- 
log. This subsample is generally representative of 
the entire BGPS catalog, except that detection in a 
molecular transition line that traces dense gas (e.^., 

HCO^(3-2)) is strongly correlated with continuum 
flux density; dim sources are largely absent from the 
Distance Catalog (see EB15 for a complete discus- 
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Physical Radius [pc] 



Number Density 


Figure 4. Physical property histograms for the Distance 
Catalog, computed using the maximum-likelih ood values 
from the Monte Carlo method described in §3.1| The 
gray filled and magenta hashed histograms represent sources 
above the 50% a nd 90% mass completeness levels, respec¬ 
tively (see ^3.2.2|). Top: Source mass distribution. Vertical 
lines mark the locations of the 50% (gray) and 90% (ma¬ 
genta) completeness levels. Middle: Physical radius distri¬ 
bution. The vertical dashed lines iden tify the boundaries be - 
tween cloud / clump / core as used by|Dunham et al.|(|2011|). 
Bottom: Number density distribution; vertical lines as m the 
middle panel. 


sion). In all three panels, sources above the 50% 
(70 Mq) and 90% (400 M^) mass completeness lev¬ 
els are indicated by gray solid and magenta hashed 
histograms, respectively. 

The mass distribution (top panel) peaks around 
100 Mq, but the completeness levels indicated by 
the filled histograms imply that the location of the 
distribution peak is observationally biased. The lo¬ 
cation of the 50% completeness level demonstrates 
that the predicted turnover at low mass is not con¬ 
strained by the current observations. Physical ra¬ 
dius is illustrated in the middle panel of Figure 
with vertical dashed lines marking plausible bound¬ 
aries between molecular clouds (large i?), clumps 
(intermediate i?), and individual cores. We place 
approximate radius boundary markers at 1.25 pc be¬ 
tween cloud and clump and 0.125 pc between clump 
and core, the same boundaries used by Dl l which 
were based on t he general guidelines from |Bergin| 


& Tafalla (2007). These divisions, while rooted in 
physical distinctions based on virial ratio and ca¬ 
pacity for Jeans fragmentation, are flexible and the 
boundaries are easily blurred by the continu um o f 
structure in molecular cloud complexes (see ^5.2), 
and by variations in Galactic environment. Note, 
however, the near complete lack of Distance Catalog 
objects with radii < 0.1 pc). From the relationship 
in Figure smaller sources have lower flux density 


and are therefore less likely to be included in the 
Distance Catalog (EB15). 

Definitions notwithstanding, the typical size scale 
of BGPS-identifled dense molecular cloud structures 
is clearly in the vicinity of 1 pc and may encom¬ 
pass multiple true but unresolved molecular cloud 
clumps. The mass completeness subsets illus trate 
the effects of Malmquist bias (coupled with the Far- 
|son| relationship connecting mass and physical ra- 
dins) whereby the 90% complete sample is nearly 
all cloud-scale (i? > 1 pc) objects, and the BGPS 
does not detect any large {R> 2 pc), lower-mass ob¬ 
jects (M < 400 Mq). This places a rough constraint 
on the minimum number density (n > 230 cm“^. 
Equation]^ detectable by the BGPS for large-scale 
objects. 

The bottom panel of Eigure depicts the num¬ 
ber density distribution of resolved sources, which 
spreads across a wide dynamic range. Approxi¬ 
mate number density boundaries at n = 750 cm“^ 
between cloud and clump, and n = 10^ cm“^ be¬ 
tween clump and core (Dll) are indicated by verti¬ 
cal dashed lines. Whereas there is nearly complete 
overlap between high-mass and large-radius objects 
(‘clouds’; middle panel), there exist a handful of 
low-mass (M < 70 ) objects with low computed 

number density (‘clouds’; bottom panel). These ob¬ 
jects, along with the spread of higher-mass objects 
(M > 400 Mq) across classifications in the bot¬ 
tom panel insinuate that number density computed 
via Equation Q for BGPS sources is not a reli¬ 
able indicator oi physical classification. It is possi¬ 
ble that non-uniform geometry of sources is causing 
the spread of the completeness samples over a large 
range, or that the algorithm by which the cataloging 
routine decomposes emission into discrete sources is 
doing so in a manner inconsistent with the underly¬ 
ing physical structure in molecular clouds (e.^., by 
assuming spherical clouds when computing A/'(H 2 )). 

4.1.3. Physical Properties as a Function of 
Heliocentric and Galactocentric Position 


A major challenge in characterizing the ensemble 
physical property distributions of detected molecu¬ 
lar cloud structures is separating observational and 
data-processing systematic effects from underlying 
physical relationships, as systematic effects on the 
measurement of physical properties can lead to sig¬ 
nificant biases in the derived quantities. Under¬ 
standing the physics of dense molecular cloud struc¬ 
tures requires well-constrained distance estimates. 
It is instructive, therefore, to analyze the functional 
dependences. 

The left panels of Eigure illustrate the depen¬ 
dence on heliocentric distance of the mass, physical 
radius, and number density for sources in the Dis¬ 
tance Gatalog. Sources are plotted as gray dots, 
with the median value per 2-kpc bin marked by 
black squares. The systematic dependence on helio¬ 
centric distance can be clearly seen in the median 
values as M R r\j and n r\j follow¬ 

ing Equations (0, (§, and Q, respectively. Eor 
the distribution of mass (top l3t panel), the hashed 
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Figure 5. Physical properties as a function of heliocentric 
distance and Galactocentric radius. Grey dots mark indi¬ 
vidual objects from the BGPS Distance Catalog, and black 
squares identify the median values in 2-kpc {d^) or 1-kpc 
(-Rgal) bins. Top Row: Computed mass for each object. The 
hashed region represents the mean < 5cr flux-density com¬ 
pleteness level as a function of heliocentric distance. For 
the right panel, this region is computed for Rg^l along the 
i = 0°,180° line. Horizontal dashed lines mark the 50% 
and 90% completeness levels (as in Figure]^. Middle Row: 
Physical radius. The lower hashed region corresponds to 
6jl = 10", while the upper marks Or = 150". For the 
right panel, the lower region is again computed for Rg^x along 
^ = 0°, 180°. Horizontal dashed lines mark the ^proximate 
cloud / clump / core boundaries (as in Figure®. Bottom 
Row: Number density. Horizontal dashed lines as above. 


region identifies the mass associated with the mean 
< 5(7 flux dens ity co mpleteness level (S'compiete = 
0.30 Jy, Section 3.2.2). Especially given the variable 
rms noise from held to field in the BGPS, sources 
may be detected in this region but completeness is 
not assured. The horizontal dashed lines mark the 
50% and 90% completeness levels (as in Eigurej^. 
The large collection of sources around = 5kpc 
has a median mass ~ 300 , whereas the median 

grows to « 2000 Mq for objects > lOkpc. 

The physical radius plot (middle left panel) shows 
similar effects, with the hashed regions marking 
Or < 10" (typical minimum angular radius) and 
Or > 150" (approximate maximum recoverable ra¬ 
dius; see G13). The horizontal dashed lines repre¬ 
sent the R = 0.125 pc and R = 1.25 pc divisions 
between physical object categories discussed above. 
Einally, the number density distribution (bottom 
left panel) displays the expected trend, except that 
the median values remain somewhat constant be¬ 
tween 500 cm“^ ^ n < 1000 cm“^ over a large 
range of heliocentric distance. Since this flatten¬ 
ing is not observed for physical radius (which also 
depends linearly on ), this range of n may repre¬ 


sent a typical value for BGPS dense molecular cloud 
structures, subject to the geometric and cataloging 
caveats discussed above. 

The right panels of Eigurej^ illustrate the depen¬ 
dence of these physical properties on Galactocen¬ 
tric radius. The vertical dashed line identifies our 


adopted Rq = 8.34 kpc (|Reid et an2014|) for refer- 

id] 


ence. Black squares mark the median values in 1 
kpc bins, and roughly show the systematic effects of 
heliocentric distance (z.e., objects far from Rgai = 
Rq are at large d^). Gaps in the black function 
represent bins with N < 20 sources, where the me¬ 
dian value is skewed by outliers. The gap in sources 
(gray dots) at Rgai = 8.5 — 9.5 kpc is pointed out 
in EB15 as arising from a unique combination of 
BGPS coverage region and a kinematic avoidance 
zone (where kinematic distances are unreliable in 
the face of non-circular motions about the Galac¬ 
tic center), and is not likely a true feature of the 
underlying Galactic dense molecular gas distribu¬ 
tion. Many of the dense molecular cloud structures 
observed in the Molecular Ring / Scutum-Centarus 
Arm feature at Rg^i = 4 —5 kpc, d^ = 3 —6 kpc, cor¬ 
respond the physical properties typical of molecular 
cloud clumps. 

The systematic effects of heliocentric distance will 
obscure some of the underlying physical trends in 
properties as a function of Galactocentric radius. 
Eor instance, the spike in median number density 
near Rgai = Rq is due to the population of nearby 
{d^ < 2 kpc) core-scale objects. The observed de¬ 
crease in mass for dense molecular cloud structures 
from Rgai = 4 kpc to Rgai = 7 k pc is consistent with 
the tre nd observed for GMCs by |Roman-Duval et al. 


(2010), but is likely primarily an observational bias. 


4.2. Mass Function Fits 
4.2.1. Mass Distribution Trials 

The full information available from the MPDEs 
(and ultimately the DPDEs from EB15) allow for 
the utilization of a large number of Monte Carlo 
trials to fit functional forms to the dense molecular 
cloud structure mass distribution. Eor each trial, a 
single value was drawn from the MPDE of each ob¬ 
ject (p.l.l|) to create an independent realization of 
the mass dist ributio n of the Distance Catal og. Both 
a power law (^3.3. 1|) and lognormal (^3.3.2|) function 
were fit to each realization, and the ht parameters 
recorded. This process was repeated N = 20, 000 
times, a value chosen to balance appropriately sam¬ 
pling the parameter spaces for the functional fits 
with computation time. The collected fit parame¬ 
ters from the Monte Carlo process were then plot¬ 
ted as two-dimensional histograms to identify joint 
confidence intervals. Belo w, we discuss the results 
of the power-law (^4.2.2|) and lognormal (^4.2.3) 
fits to the entire data set. Attempts to subdivide 
the Distance Catalog into astrophysically meaning¬ 
ful subsets are described in Appendix [A| but most 
fits resemble those of the entire sample. Maximum- 
likelihood fit parameter values for all fits are listed 
in Table O 
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Figure 6. Power-law fit to the aggregate mass distribution. Left: The complementary cumulative distribution function 
(CCDF, solid black curve) and differential mass distribution (gray histogram) for the Distance Catalog. This distribution is 
the aggregate of 20,000 Monte Carlo realizations of the mass function, normalized to a sum of N = 1710 to create an average 
mass distribution that marginalizes over the uncertainties in the mass of each object. The solid cyan and dashed black curves 
represent the maximum-likelihood power-law index (from right panel), and the vertical gray dashed line marks the maximum- 
likelihood minimum mass for which a power law describes the data. Right: The two-dimensional histogram of the power-law fit 
parameters for each of the 20,000 Monte Carlo trials, whereby a power law was fit to a single realization of the mass distribution. 
Contours enclose 68.3% (highest), 95.4%, and 99.7% of the points in the plane. The yellow X marks the largest concentration 
of points, or the joint maximum-likelihood values of a and Mmin- 


4.2.2. Power-Law Fit 

The collected power-law fits to the complete mass 
distribution are shown in Figure The left panel 
illustrates the mass distribution both as the comple¬ 
mentary cumulative distribution function (CCDF, 
solid black]^ and as the differential mass distribu¬ 
tion (gray histogram). Both distributions represent 
the aggregate of 20,000 Monte Carlo realizations of 
a single mass distribution, normalized so the sum 
equals 1,710 objects. To smooth the effects of small 
sample size and utilize the full amount of informa¬ 
tion encoded in the collected set of MPDFs, the 
aggregate is shown rather than any single one of 
the Monte Carlo trials. The power-law fits (derived 
from the data in the right panel) are shown for the 
CCDF (cyan line) and the differential mass distri¬ 
bution (black dashed line). 

The power-law fitting algorithm (PLFIT) returns 
two parameters, the power-law index a and the min¬ 
imum mass over which the data may be described 
by a power law (Mmin)- The two-dimensional his¬ 
togram of the 20,000 parameter pairs from the 
Monte Carlo trials are shown in the right panel of 
Figure Color scale intensity indicates the num¬ 
ber of points in each pixel of the histogram, and 
contours enclose 68.3% (highest contour), 95.4%, 
and 99.7% of the parameter pairs, meant to illus¬ 
trate joint confidence regions. The high degree of 
correlation between a and Mmin is indicative of the 
continuously steepening nature of the mass distri¬ 
bution shown in the left panel (where a power law 

The CCDF is simply 1 - CDF, where CDF is the 
typical cumulative distribution function. The CCDF mea¬ 
sures the probability of finding an object with mass greater 
than a given value, whereas the CDF describes the probabil¬ 
ity of finding an object with mass less than a given value. 
The CCDF is used to illustrate the distribution in a man- 
ner robust against fl uctuations caused by finite sample size 
( [Clanset et al.|2009| . 


is a straight line). The yellow X in the right panel 
marks the peak of the two-dimensional histogram, 
or the joint maximum-likelihood parameter values 
(d = 1.95, Mmin = 966 Mq). This minimum mass 
i s abo ve the 90% completeness level described in 
^3.2.2 (^ 400 Mq), meaning the power-law fit is not 
atfected by Malmquist bias, although other selection 
effects may be at play. Note the secondary peak of 
points near a = 2.3, Mmin = 3000 M^ in the right 
panel; there appears to be a sharp steepening of the 


mass distribution at the high-mass end (see ^5.3.2 
for discussion). The distribution truncates around 
Mmin = 3500 Mq due to a requirement in the fitting 
algorithm for a minimum number of data poi nts re¬ 
quired for a robust fit (seejClauset et al.|2009|). This 
truncation is a general feature of the astrophysical 
subset fits discussed in Appendix \A\ 

4.2.3. Lognormal Fit 

The continuously steepening nature of the mass 
CCDF makes fitting or interpreting a power law 
function for the entire data set difficult at best. Fur¬ 
thermore, distance inhomogeneities in large-scale 
surveys such as the BCPS tend to drive the compos¬ 
ite o bserve d mass distribution toward a lognormal 
(see ^5.3.3). We applied our lognormal fitting rou¬ 
tine to the A" = 20, 000 Monte Carlo realizations of 
the mass distribution, and the resulting fit parame¬ 
ter distributions are shown in Figure The CCDF 
and differential mass distributions in the left panel 
are identical to FigmeM (left). The functional fits 
are drawn from the ri^t panel as was done with 
the power law, where the cyan line fits the CCDF 
and the dashed black line fits the differential mass 
distribution. 

Whereas the power-law fit returns two parame¬ 
ters, the lognormal fitting optimizes the functional 
fit over four parameters: the lognormal character¬ 
istic [/i,cr], and also both ends of the mass range 
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Figure 7. Lognormal fit to the aggregate mass distribution. Left: The CCDF (solid black curve) and differential mass 
distribution (gray histogram) are identical to Figure]^ The blue and black dashed curves represent the maximum-likelihood 
lognormal fit (from the right panel), and the vertical gray dashed lines mark the maximum-likelihood minimum and maximum 
mass for which a lognormal describes the data. Right: The two-dimensional histogram of the lognormal fit parameters [/r, a] 
for each of the 20,000 Monte Carlo realizations of the mass distribution. Contours enclose 68.3% (highest), 95.4%, and 99.7% 
of the points in the plane. The yellow X marks the largest concentration of points, or the joint maximum-likelihood values. 
The inset shows the parameter space distribution for the pair [Mmin? Mmax]- 


[Mmin, Mmax] ovoi which the lognormal fit is valid. 
For computational expediency, we divided the fit¬ 
ting into a nested st ructure following the recipe 
from Olmi et al. (2013, Appendix B). The lognormal 
characteristic is fit for a given pair of mass range val¬ 
ues, then the mass range is adjusted to find the best 
match between the model CCDF and the CCDF 
constructed from the source masses. Shown in the 
right panel of Figure are the distributions of log¬ 
normal best-fit parameters paired in this fashion. 
The more fundamental is the [/i, cr] plane, shown 
in the background, since these define the position 
and shape of the function. As with the power- 
law fit, the two-dimensional histogram of the 20,000 
Monte-Carlo trial fit parameters is shown, with con¬ 
tours representing the same enclosed fractions of 
points. Both parameters are tightly constrained, 
with ft = 5.301o‘ 20 (160^0 ^ ^ 230 M^) and 

a = 2 . 00 ^ 0.08 5 where the uncertainties represent 
the marginalized 68.3% confidence intervals. The 
joint maximum-likelihood values (marginalized over 
[Mmin, Afmax], yellow X) occur at /i = 5.30, a = 
2.01, and are reflected in the solid cyan and dashed 
black curves in the left panel. Since the peak occurs 
at = 200 Mq, below the 400 90% 

completeness level, it may be biased and should be 
viewed as an upper limit. 

Inset in the right panel is the two-dimensional his¬ 
togram of the [Mmin, Afmax] plauc, showiug the rela¬ 
tionship between the bounding parameters. There 
is scant correlation, except that the spread along the 
diagonal from top-left to bottom-right represents 
expansion and contraction about a central value. 
The optimum values of these parameters (marginal¬ 
ized over [/i, cr]) are indicated by the yellow X in the 
inset (Mmin = 70 M^, M^ax = 12000 M^), and 
are marked in the left panel by vertical gray dashed 
lines. This minimum mass extends down to the 50% 
mass completeness level (~ 70 M^), and its location 
may not be entirely reliable, as the peak of the fit¬ 


ted lognormal function always falls below the 90% 
completeness level (/i < 6.0 = In 400 M^). 

Comparison of the lognormal fit in Figure with 
the power-law fit in Figure yields some interest¬ 
ing notions. As expected, the minimum mass for 
the power-law fit is well above the peak of the log¬ 
normal distribution. In fact, the 99.7% contour in 
Figm:e^{right) terminates at about this value. Fur¬ 
thermore, the curvature of the mass distribution is 
well-matched to a lognormal function over nearly 
2.5 orders of magnitude. The continual curve of 
the mass distribution may be an effect of obser¬ 
vational and catal oging biases against the highest 
mass sources (see ^5.3.2). 


4.2.4. Mass Functions of Astrophysically Motivated 
Subsets 

Theories of star formation hold that a strong 
power-law behavior should be evident for dense 
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clear single power law fit to the complete Distance 
Catalog prompted us to fit functional forms to as¬ 
trophysically motivated subsets of the data to iden¬ 
tify homogenous sets of objects predicted by the¬ 
ory. While the fits to many of the subsets, did not 
reveal any strongly compelling behavior (see Ap¬ 
pendix ffl, there are two whose power-law fits are 
fairly well constrained. The set of nearby objects 
(<^0 < 6.5 kpc) may be described by an d = 2.0T0.1 
power law, and set of criteria aimed at isolating 
molecular cloud clumps (2 kpc < < 10 kpc and 

M > 300 Mq) is fit by a d = 1.9 ± 0.1 power-law. 
The heliocentric distance cut removes cloud-scale 
objects that are more strongly affected by the an¬ 
gular filtering function of the BGPS data pipeline, 
leaving principally molecular cloud clumps, similar 
to the mixed criteria. 
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Table 2 

Mass Distribution Functional Fit Parameter Values 





Power-Law Fit 


Lognormal Fit 


Gut 

Subset^ 


6l 

Mmin 

A 

a 

Mn,in° 

M 0 

2v^max 

Type 

Name 

A 


(log Mg) 

(In Mg) 

(In Mg ) 

(log Mg) 

(log Mg) 


Distance Gatalog 

1710 

1 Q4+0-34 
-'■•^^-0.10 

z.»O-0 06 

O.C)U_Q 20 

z.uu_o 08 

1 8/1+0-08 
1-°4-0.16 

4.08l°;24 


Nearby 

1328 


2 79+0-20 
'^-0.19 

4 «3+0-15 

4.00-0 08 

1 f^9+0.08 

'^-0.02 

1 84+0-12 

1-®4-0.12 

4.14+^^ 

Distant 

382 


q qq + 0.01 

O.OO-O 29 

7 10+0-05 
+ 1C 0.05 

1 /LS+0-08 

-*--^°-0.08 

1 84+0-12 
-*-•^^-0.08 

4 14+0.24 
^•1^-0.18 

-^gal 

Set-Gen Arm 

866 

1.981°;“ 


0.00-0 25 

i.aU-0 20 

1 84+0-12 
-*-•^^-0.08 

3 9C+0-30 


Sgr / Local Arms 

610 

9 09+0.02 

z.uz_o 28 

0 iq+0.04 
'J--La-0.42 

4 90“^®'®® 
^•^'-'-1.08 

q 00 + 0.52 
^•«^-0.64 

1 84+0-12 
-*--°^-0.08 

3 9C+O-I8 

9.yo_Qj^g 


Sgr Arms 

330 

1 Q7+0-01 

-L-y / _0.24 

q 1 7+0.03 
'^•-^'-0.35 

08 

q Qq + 0.52 
^•«^-0.64 

1 84+0-12 
-*-•^^-0.08 

3 9C+O-I8 
c).yo_Qj^g 


Per / Outer Arms 

234 

1 Q4+0-03 

2 >5fi+0-04 

^•O0 o.l3 

3 40+0-23 
^•^^-0.15 

1 70+0-06 

1- ''0 o.22 

1 84+0-12 

1-°4-0.16 

3 9C+0-30 
o.»U-0 18 

R 

Glump / Gore 

807 

i.yu_o 06 

0 9^+0.01 

'^•^^-0.44 

■5 Q9+0-25 

a.oZ-0 32 

9 1 9+0.26 
^•1^-0.18 

1 84+0-12 

1-°4-0.16 

4 14+0-18 
^•1^-0.24 


Gloud 

562 

1 ^icTO.05 

1.8o 0 24 

z.yu_Q 69 

ix 915+0-30 

0.^0-0 55 

9 99+0.34 

^•^^-0.24 

1 84+0-12 

1-®4-0.16 

4 14+0.36 
^•1^-0.18 

n 

Glump / Gore 

993 

1 07+0.17 
' -0.10 

2 91+0-41 

^•^^-0.10 

p: qc:+0.15 

9 99+0.26 
^•^^-0.20 

1 84+0-12 

1-®4-0.12 

4 14+0-12 
^•1^-0.24 


Gloud 

376 

' °-0.06 

q nq+0.02 

O.UO-o 48 

. q.+0.27 

O.OO-o 55 

9 1 9+0.26 
^•-^^-0.38 

1 84+0-12 

1-°4-0.16 

4 14+0-24 
^•1^-0.24 

S 

Mean < 120 Mq pc“^ 

1444 

9 qo+0.03 
Z.OO_o 36 

'^•'04^-0.38 


1 C^i+0-04 
l.b8_o 10 

1 84+0-12 
-*--°^-0.20 

4:.UU_o 24 


Mean > 120 Mq pc“^ 

358 

Z.UO_o 38 

3 44+0-04 
'^•^^-0.45 

+ ^U_o 05 

1 79+0.18 
'^-0.08 

1 84+0-16 
-*-•^^-0.08 

A 1 O+0.30 
^•1®-0.18 


Peak < 120 Mg pc-^ 

392 

9 1 ir+0.01 

^•-'-^-0.25 

2 14+0-01 

q (39+0.85 

O.UZ-o 20 

1 70+0-02 

1- 40 

1 84+0-12 

1-°4-0.16 

4:.UU_o 06 


Peak > 120 Mg pc-^ 

1410 

1 QQ+0.36 

l.»0_o 08 

3 47+0.01 

5 83+0-07 
^•°^-0.12 

1 5^n+0-io 

1.8U-0 08 

1 84+0-12 

1-®4-0.12 

A 1 Q + 0. 18 

^•l°-0.24 

^'lsr Type 

Dense Gas 

1372 

J..ao_o 08 

q (37+0.36 
'^•'0'-0.07 

3 39+0-13 
0-'4^-0.15 

j-.aU-o 08 

1 84+0-12 

l-®^-0.12 

4 14+0-24 
^•1^-0.18 


13CO 

422 

1 97+®-®^ 
-0.08 

^•O^-0.17 

4 33+0-22 
4.OO_0 28 

1 79+0.14 

1- '^-0.04 

l-®^-0.20 

o.no-o 06 

“Protocluster” 

Blind {i < 90°) 

1513 

1 94+0-34 
-^•^^-0.09 

2 97+0-47 
-0.02 

3 OO+0.15 

o.oo_o 22 

2-08l°;l« 

l-®^-0.12 

^•1^-0.24 


R > 0.2 pc 

1331 

-^•^^-0.12 

2 99+0-19 

z.»»-0 20 

5-30l°;i« 

2-16+.M 

l-®^-0.12 

^•1^-0.24 


Mixed Protocluster 

579 

-^•^^-0.07 

2 79+0-21 
'^-0.13 

6.45l°;?« 

0.66l°;t° 

9 1 9+0.24 
^•1^-0.12 

Z.nO-o 24 


M > 2000 Mq 

216 

9 90 + 0.03 

Z.ZO-0 06 

3 415+O-O2 

Q qQ + 0.05 
0.00-0 02 

0.82l°“ 

1 80+0-04 
i.OU-o 04 

4 30“^®'^® 
n.ou_Qj^2 


^ See Appendix [A] for definitions of these subsets. 

^ The lognormal fitting routine tends to find mass bounds very near the initial guess (Mmin = 10^ ? ^max = 10^ 

but parameter correlations show that the exact bounds have little impact on the [/i, d] returned. 


The overall lack of clean fits to either the com¬ 
plete Distance Catalog or most of the subsets indi¬ 
cates that the sample is largely homogenous and the 
interstellar medium is not cleanly, tightly described 
by simpl e rel ations. These effects are discussed in 
depth in §5.3| The results of the studied astrophysi- 
cal subsets are presented in Appendixalong with 
the accompanying figures similar to Figures an d0 
The fit results are all shown in Table [2j 


4.3. The Distribution of Dense Moleeular Gas in 
the Galaetie Plane 

4.3.1. Face-On View of the Milky Way 


Ellsworth-Bowers et al. (2015, their Figure 15) 
presented a map of the locations of dense molec¬ 
ular cloud structures in the Galactic disk with re¬ 
spect to a rendering of Galactic structure based on 
Spitzer data. Since these objects smn several or¬ 
ders of magnitude in mass (Figure]^, their loca¬ 
tions alone do not necessarily indicate the mass dis¬ 
tribution of dense molecular gas. To estimate the 
Galactic disk mass surface density of the objects in 
the BGPS Distance Gatalog, we constructed a grid 
consisting of 0.25 kpc x 0.25 kpc bins into which 
masses computed for individual sources were placed. 
Since typical distance uncertainties are ^ 0.5 kpc, 
using 0.25 kpc bins approximately Nyquist samples 
the data. Taking advantage of the information con¬ 


tained in source DPDFs, we produced a map (Fig¬ 
ure]^ left) by randomly drawing 10^ distances for 
each objecF and adding the mass derived using 
Equation (|^, divided by 10^ to reflect the prob¬ 
abilistic contribution of that mass to the appro¬ 
priate grid cell. This use of the full DPDE leads 
to some of the smearing effects along the line of 
sight towards isolated regions in the outer Galaxy 
(e.^., i ^ 110°, 130°), where peculiar motions have a 
larger effect on kinematic distances. Eurthermore, 
given that BGPS coverage for i > 90° is neither 
blind nor contiguous, lack of mass in Eigure in 
the outer Galaxy should not be construed to imply 
a void in the Galactic plane. 

Mass surface density contours are plotted atop an 
image of the Galaxy produced using Spitzer near- 
infrared data (R. Hurt: NASA/JPL-Galtech/SSG) 
in the right panel of Eigure ^ As with the corre¬ 
sponding map from EB15, there are many caveats 
in the interpretation of this image. Eirst, there are 
virtually no BGPS sources with well-constrained 
distances at Rgai < 4 kpc owing to non-circular 
motions induced by the Galactic barj^ with the 

The DPDF formalism relies primarily upon kinematic 
distances whereby velocity measurements of dense molecu¬ 
lar gas tracers are mapped onto Galactic position using a 
rotation curve assuming circular motion about the Galac¬ 
tic center. The Galactic bar induces strong radial streaming 
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Figure 8. Dense gas mass surface density over the Galactic disk for Distance Catalog sources. Left: Color scale and contours 
indicate the dense gas surface mass density (xlO^ Mq kpc“^) in each 0.25kpc x 0.25kpc pixel. The Sun and Galactic Center 
are identified by stars at (a^gab ygal) = (O 5 —8.34) kpc and (0, 0) kpc, respectively. Right: The same mass surface density contours 
plotted atop an artist’s rendering of the Milky Way based on Spitzer data (R. Hurt: NASA/J PL-Caltech/SSC). S everal of the 
mass concentrations correspond to prominent star-formation complexes originally discovered by|Westerhout| (|1958| marked with 
red ‘X’s). As discussed in EB15, we have excluded from consideration of much of Rgai < 4kpc tor kinematic reasons; sources 
appearing inside this circle have robust distance estimates through association with a trigonometric parallax measurement. 


exception of objects associated with trigonomet¬ 
ric parallaxes measurement of maser transitio ns in 
regions of high-mass star formation {e.g., [Reid 


et al. 2014). Second, given that the typical uncer¬ 
tainty loFwell-constrained distance estimates from 
EB15 is ^ 0.5 kpc, the map and contours have a 
^ Ikpc effective resolution, significantly coarser 
than the underlying image, although commensurate 
with Herschel m aps of nearby galaxies (mean dis¬ 
tance ^ 10 Mpc; |Kennicutt et al.||2011|). 

Third, there is a signihcant concentration of mass 
near (xgai,%ai) = (—4,3) kpc that appears to he 
in a void in the underlying image. As discussed 
in EB15, this could represent either molecular gas 
at that location not identified in the Spitzer data 
(which is a “best guess” picture of the Galaxy), or 
sources in the W43 star-forming complex at the end 
of the Galactic bar erroneously placed at the far 
kinematic distance. Gomparison of Eigurej^with 
the Galactocentric locations of GMGs in th^^GO 
Galactic Ring Survey (GRS; | Jackson et al.||2006|) 
reveals a similar large collection o l gas at this lo- 
cation (|Roman-Duval et al. |2010, their Eigure 5). 
Most of the BGFS sources in this region are as¬ 
sociated with one or more H II regions with robust 
KDA resolution from the H II Region Discovery Sur- 

motions, rendering regions under its infiuence unusable for 
kinematic distance measurement. 


veys (HRDS; |Bania et al.||2010| |2012|) and the dis¬ 
tances to GRS clouds were primarily resolved using 
the same H I absorption techniques as the HRDS 
(EB15). 

The BGPS-GRS correlation implies either the ex¬ 
istence of a large collection of mass in an apparently 
empty region of the Spitzer image or a fundamen¬ 
tal limitation in the application of H I absorption 
techniques for distance resolution. Ev idence in sup¬ 
port o f the former comes from both Russeil et al.| 
(2011), who find interarm clumps for this region 
in the Hi-GAL i = 30° SDP field, and a study of 
the M51 spiral galaxy that reveals a popula tion of 


GMGs dow nstream of a major spiral arm (Egusa 
iniI][MI ). Eurthermore, the appearance of inter- 


arm gas could also be the effect of a spiral feature 
possessing a systematic noncircular velocity, biasing 
kinematic distances. 

Identified in the right panel of Eigure are 
the high-mass star-forming regions W43, W51 and 
W49, which stand out in the mass diagram as high 
surface density areas. Despite the rough tracing of 
some spiral structure, this map is not a definitive 
distribution of dense molecular gas in the Milky 
Way. Eirst of all, objects used to produce this map 
represent only 20% of the full BGPS catalog (by 
number, although 33% by flux density). Second, 
given the coarse resolution of the map, any con- 
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Figure 9. Vertical and radial mass distributions. The color- 
scale image in the center represents the azimuthal inteCTation 
of mass, computed in analogous fashion to Figure left. 
Since bins at larger Galactocentric radius encompasslarger 
volumes, the absolute value of the color-scale is not physi¬ 
cally meaningful, but rather represents the relative concen¬ 
trations of mass. At top is the mass distribution versus Rg^li 
marginalizing over vertical position, and at right is the dis¬ 
tribution versus z, marginalizing over Galactocentric radius. 
The fit to the mass-weighted z histogram is consistent with 
the number-count histogram from EB15. 


elusions about detected spiral structure should be 
used cautiously as the map serves to show where 
the detected mass lies, and not a complete census 
of mass in the Galaxy. The gap observed between 
60° < i < 75°, however, is real and correspon ds to a 
similar minimum in ^^GO column seen in the [Dame] 


et ah (|2QQ1 ) maps. The expansion of robust dis¬ 


tance" determination to other dust-continuum sur¬ 
veys, however, will have high utility for tracing the 
Galactic dense molecular gas distribution across the 
entire disk. 


4.3.2. Vertical and Radial Distributions of 
Star-Forming Mass 

The azimuthally integrated vertical and radial 
distributions of the mass of star-forming clouds pro¬ 
vide an alternate view of Galactic star-formation 
distribution, and are shown in Figure The Galac¬ 
tocentric radius and vertical position were com¬ 
puted using the (^, (i7gab conversion 

matrix from Appendix G of Ellsworth-Bowers et al.| 
(2013), which accounts for the 25 pc vertical sol_^ 
oltset above the Galactic midplane. As in Figure 
masses and Galactocentric positions were derived 
from 10^ independent samples from the DPDF for 
each object. The color image at the center rep¬ 
resents the amount of mass in each 0.25 kpc (ra¬ 
dial) X 10 pc (vertical) bin. Since this is an az¬ 
imuthal integration, the color scale intensity of the 
image is physically meaningless other than to iden¬ 
tify overdense regions; pixels at larger i?gai repre¬ 
sent a significantly larger volume than those nearer 


the Galactic center. Identified in the image are the 
W43, W51, and W49 regions that stand out in the 
face-on image of the Galaxy. These three regions 
dominate the mass distribution, but there are still 
clear collections of mass in the 4-5 kpc range, and 
around 6 kpc. The warping of the disk to large 
i?gai is visible in the mass distribution, but we cau¬ 
tion that BGPS observations in the outer Galaxy 
{i > 90°) were targeted towards regions of known 
star formation, and do not represent the same blind 
sample as the inner Galaxy coverage. 

To the right of the image is the marginaliza¬ 
tion over i?gai, or the the vertical histogram of the 
mass distribution, shown as total mass per bin. A 
Gaussian fit to the this mass-weighted histogram 
(/i = -2.4 ± 1.8 pc, FWHM = 63.7 ± 3.6 pc) is 
consistent with the distribution derived from the 
number-count histogram from EB15. This implies 
that the mass is roughly evenly distributed amongst 
catalog sources regardless of vertical position, and 
that there is no particular bias in the mass of dense 
molecular cloud structures near the Galactic mid¬ 
plane. 


5. DISCUSSION 
5.1. What is a BGPS source? 

Based on a subset of BGPS objects throughout 
the Galactic plane observed in the l owest inversion 
transitions o f NH 3 around 24 GHz, [Dunham et al.| 
( 2010 , 2011 ) began a discussion about the nature 
of BGPS sources. Using the larger sample of ob¬ 
jects with well-constrained distance estimates from 
EB15, we continue this discussion. 

Observationally, dense molecular cloud structures 
are typically defined in terms of cloud-, clump-, and 
core-scale objects. The BGPS is generally sensitive 
to clump-scale objects, but we investigate to what 
extent clouds and cores are present in the data. 
Eigure illustrates the relative concentrations of 
non-clump objects at various heliocentric distances, 
and is modeled after Dll (their Eigure 21 ). Gom- 
parison between the two figures yields insight into 
the effects of using a larger sample. The thin black 
lines in Eigure [To] mark the fraction of sources closer 
than a given distance with properties consistent 
with molecular cloud cores based on physical radius 
(solid; R < 0.125pc), mass (dotted; M < 27.5 M^), 
and num ber de nsity (dashed; n > 10 ^cm“^), as de¬ 
fined in p.l.2| Many (> 70%) of the objects at 
<1 kpc fall under the molecular cloud core def¬ 
inition, and conversely, the majority of core-scale 
objects are seen at < 2 — 3 kpc. This is commen¬ 
surate with the completeness and angular transfer 
functions illustrated in Eigure 

Similarly, thick blue lines in Eigure mark the 
fraction of sources located farther than a given dis¬ 
tance that are characterized as clouds {R > 1.25 pc, 
M > 750Mq, and n < 750cm“^). Of particular 
note is the disparity in the distributions between 
different definitions for GMG-scale objects. Beyond 
6 kpc, > 90% of objects in the Distance Gatalog 
have large physical radius. As found by Dll for 
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Figure 10. Fraction of source type as a function of helio¬ 
centric distance. Shown in thin black lines is the fraction of 
sources in the BGPS Distance Catalog nearer than the in¬ 
dicated heliocentric distance that are cores, and thick blue 
lines indicate the fraction of sources farther that are clouds. 
Line styles indicate core and cloud divisions as determined 
by physical radius (solid), mass (dotted), and number density 
(dashed). Both ends of the distance range suffer from small 
number statistics. (This figure is styled after Dll, their Fig¬ 
ure 21.) 


the NHs-observed subset, the density curve never 
reaches 90% because there exists a substantial frac¬ 
tion of high-density sources in the distance range 
6 kpc < < 13kpc (see Figure!^ bottom left). 

Furthermore, the geometrical uncertainties in both 
the cataloging process and derivation of physical 
properties decrease the reliability of this quantity 
for object classification. The combination of im¬ 
proved cataloging routines with higher fidelity to 
the underlying structure and future high angular 
resolution studies of the Galactic plane {e.g., with 
CCAT) may alleviate this disparity. One princi¬ 
pal deviation from the Dll results, however, is the 
mass fraction of cloud-scale objects as a function 
of distance. Whereas for the NH 3 subset the cloud 
fraction by mass closely tracked the radius fraction, 
in Figure the cloud fraction by mass is signifi¬ 
cantly smaller. This may be due either to the tar¬ 
geted nature of the Dll observations or differences 
between the BGPS version 1 and version 2 catalogs 
(see G13). 

With the enlarged set of physical properties pre¬ 
sented here, we confirm the previous conclusions 
of studies of BGPS objects that the majority of 
sources detected are clump-scale objects in the dis¬ 
tance range 2 kpc ^ < lOkpc, cores nearer by, 

and cloud-scale objects beyond. Uncertainties in 
the geometry of dense molecular cloud structures 
remains one of the most significant issues in terms 
of understanding the number density distribution 
and its role in the evolution of dense regions. 

5.2. Larson’s Laws & The Continuum of 
Strueture 


Figure 11. The mass surface density - mass (E — M) re¬ 
lationship for molecular cloud structures. Black dots iden¬ 
tify BGPS Distance Gatalog objects from Table while 
cyan squares mark the values for GRS GMGs from ^oman-| 
|Duval et al.| (|2010| labeled RDIO) and orange trian gles the 
VlKb structures discussed in|Battisti & Heyer| (|2014| labeled 
BH14). The black, cyan, ana orange lines plot tiie respective 
power-law fits to the individual data sets, and the red line 
corresponds to the power-law fit fo r the combined data. The 
original result from [Larson] (|1981|) is shown as a thick gray 
dot-dashed line. Fit parameters for each line are listed in 
Table |3] 


5.2.1. The Mass-Radius Relationship 

The physical properties of molecular clouds and 
their substructures appear to follow a series of scal¬ 
ing relationships known as Larson’s Laws, following 
their intr oduction in a l iterature study of molecular 
clouds by Larson (1981). The most frequently used 
of these scaling relationships is the size-linewidth 
relationship, which holds that virial and/or turbu¬ 
lent motions across a molecular cloud structure in¬ 
crease with the square root of the size of that struc¬ 
ture. This relationshi p is valid for single molecu¬ 
lar line tra cers (e.^., Solomon et al. 1987 Heyer 
et al.||2009|), but the linewidths of ditterent molecu 
lar transitions observed in the same dense molecu¬ 
lar cloud structure may vary by an order of magni¬ 
tude. The heterogenous nature of the spectroscopic 
data gathered for BGPS distance estimation there¬ 
fore prevents their use in deriving such a relation¬ 
ship. 


The first two Larson relationships (the size- 
linewidth and mass-linewidth relationships) may be 
combined to form a mass-size relationship, some¬ 
times called Larson’s 3^*^ Law|^ Using the physical 
radius = I //2 rather than the overall length of the 
molecular feature for consistency with the remain¬ 
der of our analysis, we derive 


M 

'mI 


= 460 



(15) 


This name is also given to the number density-size re- 
lationship similarly derived from the first two [Larson] ([1981[) 
relations. 
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Table 3 

Power Law Fit Values for Larson-like 
Surface Density - Mass Relationship 



S 

= aM>’ 


Data Set 

a 

5 


BGPS 

43112 

0.15 (0.04) 

-0.96 

RDIO 

29114 

0.16 (0.04) 

-0.98 

BH14 

o[;:+50 

0.15 (0.08) 

-0.99 

Combined 

46l^ 

0.12 (0.02) 

-0.95 

Larson (19811 

202 

-0.053 



^ Parameter correlation coefficient p = 
e’ab / e’ae’b • 


as the form of the [Larson] mass-radius relationship. 
This power-law relationship has been reproduced 
by simulations of iso thermal supersonic turbulence 
(iKritsnk et al.||2Q13D. 

While Equation (15) appears to have theoretical 
backing, it is important to note the effect of ob¬ 
servational systematics on the derived relationship. 
Combining Equations (© and <§ leads to a system¬ 
atic M (X relationship, which has be en advanced 
as the source of Lar son’s 3^*^ Law {e.g 


Ballesteros- 


Paredes et al.||2Q12 and references therein). Owing 


to the similarity between Equation ([^) and the sys¬ 
tematic relationship, we investigate the deviation 
from M (X for both BGPS sources and mea¬ 
surements of GMCs from the ^^CO(l-O) data of the 
GRS. 

To remove the effect of heliocentric distance, we 
compare the mean mass surface density (E) with 
the mass of each object. Eignre pT] plots the 1,369 
BGPS Distance Catalog objects with real Or as 
black circles, where the vertical axis is computed 
via E = . The wide scatter in points is 

likely due in part to the non-uniform geometry of 
objects (as discussed earlier). To evaluate the cor¬ 
respondence of the relationship for the BGPS ob¬ 
jects (Le., clump-scale structures) with larger struc¬ 
tures, we also plot in Eigure[^ results of two stud¬ 
ies o£j£CO__cffiiffis_^;omL_th^^ GMCs studied 
by Roman-Duval et al. (2010| hereafter RDIO) are 
shown as cyan squares, and the clouds c o ntain 


ing BGPS sources from Battisti & Heyer (2014 
hereafter BH14) are depicted with orange triangles. 
While both of these studies used the GRS ^^CO 
data, they used different source identification algo¬ 
rithms, resulting in the differences shown in Eig- 
ure[^ Eor each data set, we fitted a linear function 
in logarithmic space that is robust against outlierj^ 
to the data as logE = log a + 61ogM, and the re¬ 
sulting power-law fit values are listed in Table 
The individual fits for each data set are shown in 
the same color as the data points. Eurthermore, we 
also fit a power law through the entire collection of 
points (red) to span a further order of magnitude in 
each dimension than any one data set alone. 


Rather than a “least squares” fit, we utilized a “least 
absolute deviation” met hod to minimize t he effects of outliers 
on the resulting fit (see [Press et al.|2Q07| §15.7.3). 


Casting Equation (15) in the same form as the 
data of Eigure pT] yiel^ 


pc 


-2 


VA4 


-0.053 


(16) 


indicating a nearly-flat mean mass surface density 
as a function of mass, and is shown as the dot- 
dashed gray line in Eigure IE The slope is so shal¬ 
low as to cause only a factor of two difference in E 
over some six orders of magnitude of mass. 

Strikingly, the data from both continuum (BGPS) 
and ^^CO (RDIO, BH14) observations consistently 
show a trend of increasing E with object mass, with 
the fits to the three individual sets returning nearly 
identical power-law indices. The similarity of these 
fit values indicates no significant difference between 
physical properties computed from dust continuum 
and molecular transition line observations of dense 
molecular cloud structures. These fit values sig- 
nificantly differ from the valu e in Eq uation ( |16| ). 
Although contrasting with the [Larsonj slope, these 
fits still represent less than an order of magnitude 
change in E over six orders of magnitude in mass. 

Caution should be taken when drawing conclu¬ 
sions from this result. While the intrinsic scatter 
within each data set is large, none of th e data s ets 
could be consistent with the predicted Larsonj re¬ 
lationship (6 = —0.053) within the fit uncertain¬ 
ties. However, both the Larson relationship and 
the fits shown in TableJ^ deviate only slightly from 
the systematic M ^ R. The fact that the proper¬ 
ties of dense molecular cloud structures from three 
separate studies all reveal nearly the same, slight 
departure from the systematic 5 = 0 indicates a 
real, though slight, trend in mass surface density 
as a function of mass. In its simplest interpreta¬ 
tion, this relation seems to imply that higher mass 
GMCs are also denser and therefore could possi¬ 
bly form stars more efficiently. This interpreta¬ 
tion must be tempered, however, by the inability of 
current survey data-reduction techniques and cata¬ 
loging algorithms to accurately represent large-scale 
low-surface-brightness structures in the interstellar 
medium. 


5.2.2. The Continuum of Structure 

The continuum of structure within molecular 
cloud complexes is a fundamental property whose 
connection with the intimate processes of star for¬ 
mation requires further study. Much of the litera¬ 
ture discusses objects as clouds, clumps, or cores, 
and assigns specific properties and physical signifi¬ 


cance to each {e.g., McKee fc Ostriker|2QQ7 Bergin 


fc Tafalla||2QQ7| [KeWcutt Evans|[2Ul^ and ref¬ 


erences therein). As seen in Eigure 11 however, 
observed dense molecular cloud structures exhibit a 
continuum of properties across many orders of mag¬ 
nitude in scale; here we discuss the implications of 
this continuum. 

The near uniformity of mean mass surface den¬ 
sity of dense molecular cloud structures provides a 
crucial postulate for the construction of the Galac- 
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tic distribution of dense molecular gas. Whether 
an artifact of observational systematics or a con¬ 
sequence of supersonic turbulence |Hop- 


kins 2012), the observed uniform relationship m 
dicates that the fractal quality of the molecular 
interstellar mediurq^ is perhaps more fundamen¬ 
tal and a stronger driver of molecular cloud com¬ 
plex evolution than the individual classes of objects 
(z.e., clouds, clumps, cores) working independently. 
There exist real physical divisions as the fractal na¬ 
ture of molecular clo uds is driven by the the dom¬ 
inance of turbulence (Kritsuk et al. 2013); regimes 
dominated by other physical processes (z.e., gravi¬ 
tational collapse) obey other scaling relationships 
Aside from questions of the evolution of the ISlW 
what are the implications of these scaling relations 
on observational d ata? As a basic reference point, 
we start from the Larson result of Equation (15) 
and convert the physical^roperties to observaBIe 
quantities via Equations ^ and The resulting 
relationship between observed flux density, decon¬ 
volved angular radius, and heliocentric distance is 
given by 


Jy / \arcmmy \kpc 


- 0.10 


(17) 


This equation implies that for dense molecular cloud 
structures, the observed flux density is nearly inde¬ 
pendent of heliocentric distance, and depends only 
on the angular radius on the sky, which is the same 
as saying the mass surface density is independent 
of mass (as seen in Eigure [TT|. This relationship 
is plotted in Eigure {top) as green lines for = 
1 kpc (solid), 5kpc jotted), and 20kpc (dashed), 
and matches the slope of data quite well. Therefore, 
in maps from (sub-)millimeter continuum surveys, 
the physical scales of detected objects are indistin¬ 
guishable, meaning that a core looks like a clump 
looks like a cloud. It is only through robust distance 
estimation or application of the size-linewidth rela¬ 
tionship that the scales of molecular cloud struc¬ 
tures may be interpreted. This is an important 
point because the use of ancillary data is essentially 
the only means for deriving physically meaningful 
results from these surveys. 

In EB15, it was noted that the surface bright¬ 
ness {SB) distributions for the full BGPS catalog 
and various subsets (including the Distance Cata¬ 
log) were largely independent of cataloged flux den¬ 
sity. Indeed, by combining the surface brightness 
equation from EB15 (their Equation 3) with Equa- 


The interstellar medium, and mol ecular clouds in partic¬ 
ular, display fractal dimensions {e.g.^ |Beech|1987|), wherein 
a self-similarity relationship exists over many orders of mag¬ 
nitude and no preferred scale exists between the limits of 
Galactic disk scale heights and runaway gravitational col¬ 
lapse of molecular cloud cores. 

Following a Jeans collapse scenario, M ~ i7, given uni¬ 
form composition and temperature. 


tion ( [TtI ), we obtain 


SB = 13.3 MJy sr“ 


-1 

V Jy y Vkpc^ 

(18) 

which indicates that the surface brightness of dense 
molecular cloud structures, regardless of size scale 
and helioce ntric distance, should be nearly identi¬ 
cal. Indeed, Solomon et al. (1987) comment that for 
the standard Cy oc size-linewidth relationship, 
virial equilibrium translates into constant average 
surface density, and hence constant average surface 
brightness. The observed surface brightness along 
any line of sight is then proportional to the number 
of structures, or total mass, along that line of sight. 
If one assumed an axisymmetric model and sampled 
a large fraction of the plane, the radial dependence 
(and normalization) of the molecular gas along the 
line of sight could be solved for. 


5.3. Mass Distributions and Mass Functions 
5.3.1. Aggregate Mass Distribution versus Subsets 

The aggregate mass distribution shown in the 
left panels of Eigures and represents an inho- 
mogenous set of dense molecular cloud structures 
across the Galactic plane. The entire collection of 
sources appears to be fitted well by a lognormal 
function over more than two orders of magnitude, 
and displays one or two power laws at the high- 
mass end. Prompted by theoretical predictions for 
a power-la w form of the so-ca l led clump mass func - 
tion (e.^., jDonkov et al.|2012||Veltchev et al.|2013|), 
we made a series of astrophysically motivated cuts 
(see Appendix 0 in an attempt to isolate a more- 
homogenous population of sources. 

While the classifications of molecular cloud ver¬ 
sus clump versus core are intuitively appealing, the 
mass distributions of BGPS sources divided into 
core/clump and cloud groups (Table return func¬ 
tional fits very similar to the aggregate set. Three 
possible explanations exist, likely working in con¬ 
cert: (1) the dividing lines between object classes 
are somewhat arbitrary, (2) uncertainties in the he¬ 
liocentric distances mix objects across classification 
boundaries, and (3) the distinctions themselves are 
artificial in light of the continuum of structure in 
molecular cloud complexes. It is likely that a sur¬ 
vey with improved angular resolution may be able 
to disentangle mass function parameters as a func¬ 
tion of source classification or physical properties. 
Eurt her more, use of multi-band Hi-GAL data will 
obviate the need to assume a dust temperature (or 
to marginalize over the dust temperature, as was 
done with the Monte Garlo trials), eliminating a 
source of uncertainty in derived properties. 

The best power-law fits were for subsets primarily 
limited in heliocentric distance, and are character¬ 
ized by nearly-symmetric [d, Mmin] joint confidence 
intervals (similar to the right panel of Eigure 1^. 
These subsets were the “nearby” heliocentric dis¬ 
tance and “mixed protocluster” cuts, which exclude 
the most distant objects, with the latter addition- 
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ally excluding nearby, low-mass objects. While both 
are strongly power-law the “nearby” subset has a 
steeper power-law index (d = 2.04 ± 0.09 versus 
1.92lQ;Jg), although the two are marginally consis¬ 
tent with each other. 

The lack of a strong power-law fit for the “dis¬ 
tant” sample or the two cloud-scale subsets (in R 
and n) may be partially caused by the apparent 
steepening in the mass distribution at high-mass as 
discussed below. The subsets that do exhibit power- 
law structure, however, indicate that the presence 
of a true underlying power law (if present) is not 
obscured and conve rted to a lognormal by sample 
inhomogeneity (see ^5.3.3). 


5.3.2. Steepness of the High-Mass Power-Law Fit 

One curious aspect of the mass function fits is 
the propensity for the high-mass end to display a 
fairly steep power-law tail {a ^ 2.2 — 2.3). While 
this p ower-law i ndex is similar to the high-mass end 
of the|Salpeter| (|1955|) IMF, objects detected by the 
BGPS in this niass range are generally cloud-scale in 
physical radius. Since Galactic GMGs tend to follow 
a pow er-law mass function with a ~ 1.6 — 1.8 ( |Blitz| 
1993), the present result is somewhat contradictory. 

steep power-law function (z.e., a > 2) im¬ 
plies that most of the total mass is in low-mass 
objects and that high-mass objects are rarer than 
more shallow functions (z.e., there are fewer per 
mass bin; Equation]^. Could the apparent steep¬ 
ening be caused by “missing” high-mass sources 
in the present Distance Catalog? Simple simula¬ 
tions show that removal of sources from a shallow 
power-law distribution does have the effect of im¬ 
itating a steeper power law, given that a progres¬ 
sively larger fraction of sources are removed with 
increasing mass. 

Between the atmospheric subtraction in the 
BGPS data-reduction pipeline removing emission 
on large angular scales (> 300"; G13) and the ten¬ 
dency of the cataloging routine to break up large- 
scale emission into smaller chunks based on inter¬ 
mediate peaks in the flux-density distribution, the 
net effect is to remove the largest scale objects and 
convert single large structures into multiple smaller 
ones. What effect do these have on the derived mass 
distribution? Converting the mass-size scaling rela¬ 
tionship of Equation into observational quan¬ 
tities via Equation yields 


M = 43.8M, 

\arcmm J \kpc J 


(19) 


The effective Or < 2^5 limit of the BGPS means 
that clouds at =5 kpc (near the peak of the he¬ 
liocentric distance distribution) with M > 5000 
should be either truncated or broken into multiple 
smaller sources. At a distance of 10 kpc, the filter¬ 
ing affects sources with M > 2 x 10^ M^. Indeed, 
in the Physics Catalog of Table there are only 7 
objects with above this value. In terms of the 
mass distribution, the breaking up of large struc¬ 
tures into smaller ones adds objects to the low-mass 


end of the mass distribution, steepening the power 
law. The power-law indices presented in Table 
should, therefore, be viewed as upper limits. Accu¬ 
rate measurement of the high-mass power-law dis¬ 
tribution of dense molecular cloud structures, there¬ 
fore, requires retaining some amount of large-scale 
structure in survey images and catalogs. 

5.3.3. The Inevitability of a Lognormal + Power-Law 
Mass Funetion? 

Observationally, the aggregate mass distribution, 
as shown in Eigurej^ {^eft)^ should be expected to 
be well fitted by a lognormal function with a power- 
law tail regardless of the actual underlying distribu¬ 
tion of source mass. Given the finite angular reso¬ 
lution and flux-limited nature of the BGPS (and 
similar surveys), it is possible to simulate the ob¬ 
served mass distribution of an inhomogenous sam¬ 
ple spread across the Galactic plane. 

We conducted a series of Monte Carlo trials, 
whereby we “observed” 100 star-forming regions 
(SERs) whose distances were drawn from the 
distribution of the Distance Catalog. Each SER 
consisted of 10^ objects drawn purely from either 
a power-law (Trial 1) or lognormal (Trial 2) mass 
distribution. A physical radius was assigned using 
Equation ( [T^ to estimate whether any given ob¬ 
ject would oe resolved by the BGPS, and the ex¬ 
tent to which its flux density (computed from the 
mass) would be diluted in the beam. Using an av¬ 
erage BGPS noise level, sources were culled from 
the list that would not be detected {Sy < 3cr). 
The aggregate distribution of “detected” sources 
was recorded, and the process repeated for N = 100 
Monte Carlo realizations. 

The results of Trial 1, where the underlying 
power-law mass distribution was allowed to range 
over 1.8 < a < 2.2 with Mmin = 1 M^are shown in 
the left panel of Eignre \T^ The aggregate shows a 
strong power-law tail with a = 1.9, and a lognormal 
distribution at lower mass, peaking at « 15 . 

While the location of the lognormal peak is well 
below the completeness levels of the BGPS data 
set, these simple simulations do not account for real 
features in the data, such as source confusion and 
hierarchical (fractal) structure, and likely overesti¬ 
mate the number of low-mass sources visible. Trial 2 
(right panel) constructed individual lognormal mass 
distributions using the ranges 4.5 < fi < 6.5 and 
1.5 < (J < 2.5. The resulting aggregate mass distri¬ 
bution shows a steeper {a ~ 2.3) power-law tail at 
high mass, but has a lognormal fit much closer to 
that of the full BGPS set in Eignre These simu¬ 
lations are not meant to definitively show the form 
of the underlying mass distributions, but rather to 
demonstrate the inevitability of a lognormal distri¬ 
bution with a power-law tail at high mass for inho¬ 
mogenous data sets such as the BGPS, regardless 
of the underlying source distribution. The inho¬ 
mogeneity should be reduced in the astrophysically 
motivated subsets (Appendix]^, hence we do not 
necessarily expect them to approach lognormal dis¬ 
tributions with muted high-mass power laws. 
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Figure 12. Simulated aggregate mass distributions of star-forming regions (SRFs) at a variety of distances. Only sources in 
those regions that would be detected by the BGPS (ie., S > 3cr) are included. Left: Composite of 100 SFRs, each with a 
power-law mass distribution with index 1.8 < o; < 2.2. The error bars represent the spread among 100 Monte Carlo realizations. 
The green line represents the power-law fit to the high-mass end of the distribution, and the red curve is the lognormal fit to 
the peak. Vertical dot-dashed lines represent the 50% (M = 70 Mq) and 90% (M = 400 Mq) mass completeness levels, and 
the vertical dashed red line marks the of the lognormal fit. Right: Composite of 100 SFRs, each with a lognormal mass 
distribution with mean 4.5 < /i < 6.5 and width 1.5 < cr < 2.5. Other features as at left. 


5.3.4. Comparisons with the Literature 

There has been much discussion in the literature 
about dense molecular c ores and their relat ionship 
with the stellar IMF (see|Shirley et al.|2003|), but it 
is only with the recent advent ot large-scale Galactic 
plane surveys that large, statistically robust samples 
of clump-scale objects have become available. For 
comparison with the present results, we discuss the 
recent studies of clump-scale objects by |01mi et al.| 
(2013) from the science dem onstration helds from 
the He rschel Hi-GAL s urvey (jMolinari et al.|2010b) 
and by Wienen et al. ([2M5|) from the A = 870 /iin 


Turning first to the 


ATLASGAL survey (pc hnller et al. 
,h( 


[ 2 00 ^. 

(|2013p analy- 


'Ulmi et al 

SIS, direct comparison with the results is 

not fully straightforward, as the source-finding al¬ 
gorithm used by those auth ors is based on a Mex- 
ican Hat Wavelet (e.^., Barnard et al.| [2004^ 


and typically finds structures on scales smaller 
than those identified in the BGPS catalog (which 
used a seeded-watershed algorithm). Furthermore, 
Hi-GAL sources were identified as well-separated 
sources visible in each of the A = 70 /im, 160 /im, 
and 250 /im bands; the beam size at 250 /im is 
17", approximately half the width of the BGPS 33" 
beam. These two features imply that the |01mi et al. 
data set will find more, smaller sources than the 
BGPS. 

Gomparing source lists in the I = 30° field, the 
BGPS V2 c atalog identifies 492 sources, whereas 
|01mi et "dl] report identifying 1,950 sources n the 
same longitude range. Although the Hi-GAL lat¬ 
itude range extends twice as far as the nominal 
BGPS coverage, the bulk of the Herschel-measwced 


emission lies within the boundaries of the BGPS 
mosaics. For the i = 59° field, the differences are 
much more astou nding; the BG PS catalog contains 
17 sources while |01mi et al.| find 3,402. A vi¬ 
sual comparison of the data sets reveals that the 
Hi-GAL tile contains significant filamentary struc¬ 
ture at modest flux density whereas the BGPS mo¬ 
saic exhibits nearly uniform noise, indicating that 
atmospheric removal has obliterated much of this 
low-level signal. The vast majority of the Hi-GAL 
sources in this tile l ie along these filaments (see 
Molinari et al.||2010a their Figure 3). Gareful com- 

"D "D -v-x r~i 'Ll y-xTT -ry-vT ry-v-v* r~i "Li y-x-r-r •rr~i 


parison of Hi-GAL and BGPS maps, however, shows 
that much of what appears to be atmospheric 1/f 
noise actually corresponds to the cirrus-like filamen¬ 
tary structure made choppy by the filtering. 

With these differences between the two data sets 
in mind, we compare the po wer-law fits to the m ass 
distribution of sources. Since Olmi et al. (2013) use 
a uniform distance for each of the two fields = 30° 
is assigned a distance of 7.6 kpc and i = 59° is as¬ 
signed to 3.6kpc), it is possible to compare their 
mass function fits to our subsets in heliocentric dis¬ 
tance. Taking first the £ = 30° field, the large dis¬ 
tance assigned to the Hi-GAL sources corresponds 
to the set of distant BGPS objects {d^ > 6.5 kpc; 
see Appendix A.l). While our sample is not fit¬ 


ted well by a power-law, the maximum-likelihood 
parameters are d = 1.93lo!o5 M > 2000 . 

This is in comparison to the steeper [Olnii et al.| fit 
values (d, Mmin) = (2.15, 212M^). The lognor¬ 
mal fit presented by those authors has a peak near 
= 100 Mq, somewhat smaller than the 
fit to the entire BGPS set (Table [^, following the 
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trend of their power-law fit. 

The £ = 59° field is placed at a distance corre¬ 
sponding to our nearby sample in heliocentric dis¬ 
tance < 6.5 kpc). The nearby sample is fit¬ 

ted well by a power-law with a = 2 . 01 ^o!o 9 
M > 620 M^. As with the i = 30° field, Olmi 


et al. (2013) find a steeper power-law that extends 


to lower mass (d, Mmin) = (2.20, 7.3 M^). Be¬ 
cause this power-law fit extends to such low mass, 
it is tempting to conclude that this Hi-GAL tile is 
imaging core-scale rather than clump-scale objects. 
Since sources must be identified in the A = 250 /im 
images whose angular resolution is ~ 25", however, 
this corresponds to a minimum observable size of 
^ 0.4 pc at the ass igned 3.6 kpc d i stanc e. 

The more recent Wienen et al. (2015) analysis of 
clump-scale dense molecular cloud structures in the 
ATLAS GAL data provides a more direct compari- 
son set with the BGPS Distance Gatalog. While the 
Olmi et ah] results derive from two specific regions, 
ATLAS GAL sources span the entire inner Galac¬ 
tic plane (—60° < £ < 60°). As such, it suffers 
from the same spread in heliocentric distance (and 
hence resolving power) as the BGPS and sweeps 
up objects across the spectrum of molecular cloud 
structures. We compare the BGPS Distance Gata¬ 
log distribution with the ATLASGAL sources from 
the first Gal actic quadrant , a lar gely overlapping 


population. Wienen et al. (2015) note that only 


a 70% distance matching rate (within 2 kpc) exists 
between their distance assignments and those from 
EB15. This relatively large disagreement seems to 
arise primarily from the fraction of objects placed 
at the far kinematic distance (44% of ATLASGAL 
sources versus 22% of BGPS sources). Those au¬ 
thors dismiss the significance of the distance mis¬ 
match by claiming that distance errors average out 
by virtue of the large ATLASGAL sample size. 
However, if the population of true distances is sim¬ 
ilar for the two surveys, the larger fraction of AT¬ 
LASGAL sources placed at the far kinematic dis¬ 
tance would systematically shift the mass distri¬ 
bution and affect the mass function fits. While a 
detailed distance catalog comparison is beyond the 
scope of this paper, the significant overlap between 
the ATLASGAL and BGPS survey data merits a 
careful comparison of source identification schemes, 
derived distances, and implications for mass distri¬ 
butions. 

Notwithstanding the discrepancies between the 
two distance catalogs, differences in source prop¬ 
erties arise from the angular transfer function of 
the ATLASGAL data, which ranges from the beam- 
scale 1972 to a maxim um recoverable scale of ~ 150" 
(jSchuller et al.|2009|. This beam size is « 60% that 
of the BGPS, meaning that smaller-scale structure 
is resolved in the ATLASGAL mosaics. The corre¬ 
sponding physical size at the 4 kpc heliocentric dis¬ 
tance peak in the source distribution is 0.4 pc, 
meaning that the sour ce catalog is still domin ated 
by clump-scale objects (Bergin & Tafalla|2007). On 
the large-scale end, the A'l'LASGAL data reduc- 


tion pipeline effectively removes uniform emission 
on scales larger than ^ 2^5, whereas the BGPS an¬ 
gular transfer function is > 0.8 for scales smaller 
than ^ 5' (G13), leading to the recovery of larger 
objects. 

The above-mentioned differences may play a role 
in the com paris on between the mass fu nction fits de¬ 
scribe d in §4.2| and those presented by [Wienen et al.| 
(2015). While those authors experimented with ht- 


tmg power laws to binned data, we compare only 
the results obtained using the maximum-likelihood 
method of Glauset et al. (2009). A power-law func¬ 
tion fit to the entire A'ITASGAL source collection 
yields a = 1.82 ± 0.02, somewhat flatter than our 
value of d = 1.941 q ^q. The source of the difference 
is unclear, unless there exist sources of significant 
mass in the ATLASGAL catalog within the i?gai = 
4 kpc kinematic avoidance zone (EB15) for which 
kinematic distances are not computed for BGPS 
sources. Eurt her more, the very small error bars 
reported by those authors conflict with the wider 
spread in power-law index resulting from the Monte 
Garlo realizations of the BGPS mass distribution 
shown in Eigure and likely do not take into ac¬ 
count the effects of distance uncertainty. T he only 
subset of the data to which I Wienen et al.l fit mass 
functions is the cut 2 kpc < < 5 kpc, encompass¬ 

ing the peak of the heliocentric distance distribu¬ 
tion. Eor this subset, they find a power-law index 
of d = 2.26±0.05, somewhat steeper than either our 
set of nearby sources (dQ< 6.5 kpc; d = 2.0 ± 0.1) 
or the mixed protocluster criterion (2 kpc < dQ< 
10 kpc, M > 300 M^; d = 1.9 ±0.1). One possible 
explanation for the steeper ATLASGAL fits is the 
filtering of large-scale emission into sm aller s ources 
due to the an gular transfer functio n (^5.3.2). Also 
included in the Wienen et al. (2015) analysis is a log¬ 
normal fit to this distance subset, yielding a peak 
near 700 M , considerably larger than the lognor¬ 


mal fits to BGPS Distance Gatalog sources (^4.2.3). 

Eor both these comparison studies, a better un¬ 
derstanding of the discrepancies between the mass 
function fits will come when all data sets are ana¬ 
lyzed with the same cataloging method to remove 
systematic effects introduced by source identifica¬ 
tion. Eurthermore, a more detailed study of the 
filtering effects of data pipeline angular transfer 
functions on source identification within large-scale 
structure may provide a more solid basis of compar¬ 
ison between results from different (sub-)millimeter 
Galactic plane surveys. 


5.4. Estimating the Dense Gas Mass Fraetion 

The Galactic mass distribution in Eigure [^rep¬ 
resents the surface density of the dense molS^ular 
cloud structures observed by the BGPS across the 
Galactic plane. Although the mass in that figure is 
diluted into 0.25 kpc x 0.25 kpc pixels, it still rep¬ 
resents a large-scale measurement of the dense gas 
mass on scales approximating the spatial resolution 
of e xternal galaxies in the H ersehel KINGEISH sur¬ 
vey (Kennicutt et al. ]MI ). Placing the Milky Way 
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Figure 13. Two estimates of the dense gas mass fraction 
estimate as a function of Galactocentric position. In both 
panels, contours indicate linear increments of 0.5%, and the 
stars mark the locations of the Galactic center (iCgahygal) = 
(0, 0) kpc, and the Sun (iCgabygal) = (0,—8.34) kpc. (a) 
The face-on map was computed by dividing the mass sur¬ 
face dens ity of Figure by the azimuthally averaged H 2 
model of|Wolfire et al.|^003|). (b) Same, except the face- 
on mass surface density map used consists only of sources 
with n > 10^ cm“^. 


in the context of fully visible galactic disks is an im¬ 
portant step toward linking locally observable small- 
scale processes with the global pictures gleaned from 
external galaxies. In addition to studying the mass 
surface density of dense molecular cloud structures, 
we can ask what the dense gas mass fraction is. 

Because our estimate of the dense gas mass sur¬ 
face density is measured on kiloparsec scales, it is 
important to select a model or measurement of the 
total molecular gas content with similar resolution. 
For this purpose, we utilized the a zimuthally av¬ 


eraged H 2 distribution modeled by IWolfire et al. 
(|2QQ3 ) from the data presented in Bronfman 


et al. (1988). This model does not take into ac¬ 


count azimuthal asymmetries such as spiral arms or 
the Galactic bar, but it provides a baseline value to 
estimate the fraction of the surface density in the 
Galactic disk measured by the BGPS Distance Gat- 
alog. 

Two estimates of the dense gas mass fraction map 
are shown in Figure 13 The first (panel a) was com¬ 
puted by dir ectly di\dd^ing the surf ace density in Fig¬ 
ure!^ by the IWolfire et al.| (|2003|) model. Since the 
BGK is sensitive to large-scale diffuse objects at 
large heliocentric distance (see Figure!^, this first 
estimate includes many objects whose oiilk density 


is not significantly enhanced over the base Wolfire 


et al.| model. The second estimate of the dense gas 


mass fraction (panel b) was computed in the same 
way as the first estimate, but includes only objects 
with n > 10^cm“^. While this value is not gener¬ 
ally considered “dense” gas, it represents a view of 


the distribution of the denser molecular cloud struc¬ 
tures detected by the BGPS. 

The vast majority of pixels in the maps have <1% 
dense gas mass fraction, but some have > 5% (note 
that the color scale saturates at ~ 2.1%). Recall 
that the surface mass density in Figure [^represents 
the mean of BGPS sources over 0.25 kpc x 0.25 kpc 
pixels, whereas the typical size of GMGs is ^ 10 pc. 
Therefore, the dense gas mass fraction shown in Fig¬ 
ure [^should be considered as a lower limit, aver¬ 
aging over a large area, especially in light of the 
total mass shown in the left panel coming from only 
20% of the BGPS catalog (by number, but 33% by 
flux density). This point should be further empha¬ 
sized for the right panel, with the number density 
threshold required for inclusion. 

For context, two recent studies have analyzed 
the dense gas fraction of molecular cloud st ructures 
hosting BGPS dust-continuum sources. |Battisti| 
& Heyer (2014) compared the masses from BGPS 
sources with the masses of the parent GRS GMGs, 
and found a mean value of O.lllo.oe- study did 
not apply a specific number density threshold, but 
rather investigated the fraction of the ^^GO GMG 
mass visible in dust continu um emission. In con¬ 
trast, Ginsburg et al. (2015) studied the high-mass 
W51 star forming complex, specifically analyzing 
the dense gas fraction. Those authors found that 
^ 10 — 15% of the W51 GMG mass consists of gas 
with n > 10^ cm“^. Given the caveats about the na¬ 
ture of Figure the present measurement is con¬ 
sistent with bom of these values. 

In interpreting the dense gas fraction maps in Fig¬ 
ure two issues bear discu ssing. The first is the 
“broM-brush” nature of the Wolfire et al.[ 
model of H 2 gas. Not only are there sizac 
certainties in both disentangling of (optically thick) 
^^GO emission into the near and far kinematic dis¬ 
tances and the variable nature of the Xco conver¬ 
sion factor as a function of environment (jBolatto 
et al.||2013|), but enhancements in the surface den- 
sity of H 2 in spiral arm features (coincident with the 
measured position of BGPS sources) would tend to 
decrease the measured dense gas mass fraction. 

The second issue is the finite resolution of cur¬ 
rent (sub-)millimeter continuum surveys and the 
nature of detected objects as a function of helio¬ 
centric distance. Beyond ^ 10 kpc, the major¬ 
ity of detected objects are cloud-scale with lower 
mean density (see Figure j^. What is the thresh¬ 
old for objects to be included in the dense gas mass 
fraction? One illustration of this dilemma is the 
previously discussed concentration of dense gas at 
(%ai,%ai) ~ (-4, 3) kpc in Figure a). While 


this may correspond to gas in the Sa^tarius arm, 
is the BGPS detecting just the dense portion, or 
whole clouds {d^ ~ 12kpc)? Despite these caveats, 
however, this type of measurement offers a key in¬ 
sight into the structure of the Milky Way within the 
context of galaxies in the local universe. 


6. SUMMARY 
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A major payoff of large-scale blind continuum sur¬ 
veys of the Galactic plane at (sub-)millimeter wave¬ 
lengths is deriving the physical properties of regions 
hosting high-mass star formation. A detailed cen¬ 
sus of these dense molecular cloud structures can 
help constrain star-formation and galactic-evolution 
theories. In this work, we used the DPDF distance- 
estimation formalism of EB15 to compute the phys¬ 
ical properties for the set of 1,710 BGPS V2 sources 
in the Distance Gatalog. The significant aspects of 
this paper are: 


1. We utilized the full information available in 
the Bayesian DPDF framework introduced in 


Ellsworth-Bowers et al. (2013) and EB15 to 
compute physical property probability den¬ 
sity functions. Although the DPDFs them¬ 
selves are computed directly on a linear dis¬ 
tance scale, the application of the DPDF, in 
concert with probability density functions for 
source flux density with uncertainty and an 
estimate of the temperature distribution of 
sources, to the computation of physical prop¬ 
erties requires Monte Carlo methods. We 
present a catalog of the masses, physical radii 
and number densities for the objects in the 
BGPS Distance Catalog, with uncertainties 
derived from the individual probability den¬ 
sity functions. 


2. Unlike studies of isolated regions at a uniform 

distance, where flux-density completeness lim¬ 
its map directly onto mass completeness lim¬ 
its, large-scale blind dust-continuum surveys 
of the Galactic plane suffer from Malmquist 
bias and care must be taken to estimate the 
mass completeness levels for these heteroge¬ 
nous samples. We demonstrate a means for 
estimating the mass completeness function us¬ 
ing the rms noise at the locations of sources 
in the survey. We estimate that the BGPS 
Distance Catalog is 90% complete for M > 
400 Mq and 50% complete for M > 75 . 

3. To understand the role of clump-scale fea¬ 

tures in molecular cloud complexes, we com¬ 
pute both power-law and lognormal fits to 
the mass distribution. Fits to the entire 
collection of sources yield a power-law in¬ 
dex a = 1.941q‘^q for objects with masses 
10^ Mq ^ M < 10^ Mq , and a lognormal 
(/i, a) = (5.3, 2.0), where the peak of the dis¬ 
tribution is at = 200 . We also 

fit both functional forms to a variety of as- 
trophysically motivated subsets in an attempt 
to identify a collection of BGPS sources that 
represent a “clump mass function” or “proto¬ 
cluster mass function”. The subset of nearby 
sources {d^ <6.5 kpc) is fitted well by a power- 

law with index a = 2 . 01^0 09 ? 
of clump-mass objects at moderate distances 
(M > 300 Mq and 2kpc < d^< 10kpc) is 
fitted well by a power-law with index a = 


For nearly all subsets, however, 
we find power-law indices 1.85 < a < 2.05. 
The exception to this rule is the set of sources 
at M > 2000 Mq, which has a ^ 2.2; this 
steeper than expected power-law index may 
be caused by observation biases in the BGPS 
against large angular-extent sources. Lognor¬ 
mal fits to the subsets reveal that the less- 
homogenous collections of sources are better 
fit by this form. Collections of sources at 
similar distances show stronger power-law be¬ 
havior. Furthermore, simulations of sources 
at a variety of heliocentric distances reveal 
that these inhomogenous samples produce an 
observed lognormal mass distribution with a 
power-law tail regardless of the underlying 
mass distributions. 


4. Combining the heliocentric distances from the 
DPDFs with the masses derived in this work, 
we compute the distribution of dense molecu¬ 
lar gas in the Galactic plane. Both the face- 
on and azimuthally integrated i?gai—^ projec¬ 
tions of the Galactic mass distributions reveal 
the prominent high-mass star-forming regions 
W43, W51, and W49 as significant concen¬ 
trations of mass. The mass-weighted vertical 
mass distribution has a scale height (~ 30 pc) 
consistent with the number-count distribution 
from EB15, implying that mass is uniformly 
distributed among sources regardless of ver¬ 
tical position. Dividing the disk’s mass sur¬ 
face density (face-on view) by a smooth model 
for Galactic H 2 distribution yields an esti¬ 
mate of the dense gas mass fraction, which 
peaks around 5% for the highest mass com¬ 
plexes. This is cons istent with the measured 
11% value from the Battisti & Heyer (2014) 
study of GRS clouds, given that BGPb source 
mass spreads into pixels 250 pc on a side com¬ 
mensurate with the typical uncertainty in the 
DPDFs), whereas typical sizes of GRS clouds 
are < 10 pc. Our measurement is a lower limit 
due to the inclusion of only sources with well- 
constrained distances (20% of the BGPS V2 
catalog by number, 33% by flux density). 


The physical properties of BGPS-detected dense 
molecular cloud structures span the range from core 
to cloud, and illustrate the fractal na ture of molecu- 
lar cloud complexes, embodied in the Larson (1981) 


scaling relationships. While there are theoretical 
motivations for these designations, such as gravita¬ 
tional collapse for protoclusters (clumps) or individ¬ 
ual stellar systems (cores), the systematic observa¬ 
tional effects in current surveys makes it difficult to 
accurately categorize individual objects, especially 
since physical processes, such as gravitational bind¬ 
ing, are environment specific. What data such as 
those presented here can do, however, is begin to 
provide observational constraints on the evolution 
of molecular cloud complexes and the context for 
high-mass star formation. 
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Figure 14. Physical property distributions for BGPS Distance Catalog sources used for division into astrophysically meaningful 
subsets. Top Row: From left to right are shown the heliocentric distance, Galactocentric radius, and physical radius distributions 
of objects. Bottom Row: From left to right are shown the number density, mean mass surface density, and peak mass surface 
density distributions of objects. In each panel the vertical dashed line(s) indicates the dividing line between subsets, with each 
labeled at top, as listed in Table |2| With the exception of the and i^gal panels, only sources with real Or are shown (see 
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APPENDIX 

A. MASS FUNCTIONS BASED ON 
ASTROPHYSICAL CUTS 

BGPS Distance Catalog objects have wide- 
ranging distributions of heliocentric distance, 
Galactocentric radius, physical radius, number den¬ 
sity, and mass surface density. These wide distri¬ 
butions suggest that the present sample is a rather 
heterogenous set of objects; conclusions drawn from 
their aggregate mass distribution may be affected by 
global changes in physical conditions and properties 
and not necessarily representative of any one type 
of physical object. Indeed, we know the BGPS gen¬ 
erally identifies cores to a distance of 1 kpc, clumps 
from 1 kpc to 7kpc, and clouds beyond 7kpc (Dll). 


As such, it is instructive to divide the total collec¬ 
tion of sources into astrophysically meaningful sub¬ 
sets and analyze the resulting mass function fits in¬ 
dividually. 

The physical quantities chosen to discriminate ob¬ 
jects for further analysis are shown in Figure \T^ 
Each panel of the figure shows the distribution 
of sources with dashed vertical lines marking the 
boundaries between the subsets. The text in each 
panel identifies the physically significant subset that 
corresponds to the entries in Table Note that 
in the physical radius and number density panels, 
there is a third region for molecular cloud cores, but 
they are considered with the clump-scale objects to 
avoid biases from small-number statistics. In addi¬ 
tion, we examined the effect of an object possessing 
a dense gas versus ^^CO primary assignment 
(see EB15) and attempted several heterogenou s cut s 
in search of a “protocluster” mass function (^A.6|). 
The implications of these subs ets and their tunc- 
tional fits are discussed in §5.3| 

The functional fits to the mass functions for the 
following subsets were computed in the same man¬ 
ner as for the entire Distance Catalog, as dis¬ 
cussed in §4.2| For compactness. Figures 
show the mass distributions and fit parameter joint- 
confidence intervals in three panels horizontally, 
with subsets of the same criteria shown as rows. The 
left column shows the mean of 2 x 10^ Monte Carlo 
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Figure 15. Mass functions of subsets in heliocentric distance. Top row illustrates the nearby subset (d^ < 6.5 kpc; lines in 
blue) and the bottom row more distant objects (lines in red). Left: The mass distribution shown as the CCDF (gray circles) 
and differential mass distribution (gray histogram). The power-law fits are shown as solid light-colored (CCDF) and dashed 
black (histogram) lines, and the lognormal fits as long-dashed (CCDF) and solid (histogram) dark-colored lines. Middle: Joint 
confidence intervals for the power-law fit, as in Figure Right: Joint confidence intervals for the lognormal fit, as in Figure 


mass distributio ns for the given subset, computed 
as described in §4.2. 1| The middle column displays 
the [d, Mmin] power-lW parameter joint-confidence 
intervals, as in the right panel of Figurewhile the 
right column plots the [/i, d] and [Mmin,^max] log¬ 
normal parameter joint confidence intervals (as in 
Figure!^. For both the middle and right columns, 
the rows have the same plotting ranges for easy vi¬ 
sualization of the differences between the fits to the 
subsets. 


A.l. Heliocentric Distance 

The heliocentric distance histogram for Distance 
Catalog objects (Figure [l4{ top left) exhibits a min¬ 
imum at ~ 6.5 kpc. Beam-scale BGPS objects 
at this distance have R ^ 1 pc, indicating that most 
of the sources beyond this point are likely GMC- 
scale. Dividing the sample here creates a “nearby” 
group containing 78% of the objects, with the re¬ 
maining 22% in the “distant” group. The mass dis¬ 
tributions and functional fit parameters are shown 
in Figure pis) 

The neaxlBy sample (top row) is not polluted with 
more distant objects, and the distribution strongly 
resembles a power law with a = 2 . 0 llo!o 9 
M > 620 Mq , as evidenced by the quasi-symmetric 
joint confidence intervals in the middle panel. The 
lognormal fit (right panel) shows a strong prefer¬ 
ence for values near the joint maximum-likelihood 
(/i, d) = (4.85, 1.72). The low-probability tail to¬ 
wards the top-left corner indicates the need for a 
wider gaussian to fit the data if the mean is moved 
to lower mass. 

In contrast, the distant sample (bottom row) does 
not exhibit a strong power-law shape, as the mid¬ 
dle panel suggests a continuum of [d, Mmin] across 
the 20,000 Monte Carlo trials, with a marginalized 
maximum-likelihood power law index a = l.OS^oios* 


The small sample size of the distant group also 
means that each independent realization of the mass 
distribution will demonstrate more variability in the 
fit parameters than for a large-sample. Despite 
the volatility of the power-law fits, however, the log¬ 
normal fits have a tightly constrained mean around 
« 1200 Mq. The significant difference 
between the jd for the nearby and distant subsets 
clearly illustrates the systematic M ^ d? . 


A.2. Galactocentric Radius 
Galactic environment can influence the proper¬ 


ties and e volution of star-forming regions (Kenni- 
cutt 1998), and we use Galactocentric radius as a 


proxy tor Galactic environment to study the differ¬ 
ences in the mass distribution from the Molecular 
Ring out to tow ard the truncation o f the stellar disk 
(Rgai ~ 15 kpc; Ruphy et al. 1996). We note that 


disambiguating the effects of heliocentric distance 
from Galactocentric radius is somewhat difficult due 
to the Sun’s location within the Galactic disk (z.e., 
objects with small Rgai have larger d^; see Figure]^ 
for the effects on the physical properties). The his¬ 
togram of source Rgai (Figure!^ top center) shows 
several concentrations of sources for further investi¬ 
gation. The molecular ring / Scutum-Gentarus Arm 
feature contains objects at Rgai < 5.5kpc (50%), 
sources at 5.5 kpc < Rgai < 8.5 kpc are likely part 
of either the Sagittarius or Local Arms (36%), and 
anything beyond the solar circle (Rgai> 8.5kpc) is 
part of either the Perseus or Outer Arms (14%). 
This assignment is not one-to-one, as seen in the 
object-location face-on view of the Milky Way from 
EB15 (their Figure 15) where some objects appar¬ 
ently identified with the Perseus arm on the far side 
of the tangent point near 30° ^ ^ ^ 50° lie within 
the middle Rgai group. For the purposes of this 
analysis, distance from the Galactic center is the 
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Figure 16. Mass functions of subsets in Galactocentric radius. Top row illustrates objects in the Scutum-Centarus arm 
(-f^gal ^ 5.5kpc; lines in blue), and the second row objects in the Sagittarius and Local arms (5.5kpc < Rga\ < 8.5kpc; lines 
in red). The third row are the subset of sources from the second row at heliocentric distances greater than 2.8 kpc to remove 
contamination from local sources, meant to illustrate the Sagittari us a rm only (lines in pink). The bottom row objects in the 
Perseus and Outer arms (lines in green). Columns are as in Figure 


factor of interest; arm names are simply used as de¬ 
scriptive identifiers. 

The individual mass distributions and functional 
fits to each subset are shown in Figure Ob¬ 
jects in the Scutum-Centarus arm (top ro^ illus¬ 
trate a somewhat-constrained power-law fit with 
a = 1.98lo!ii foi* ^ ^ although the same 

high-mass steepening at M > 10^ found for the 
complete sample is suggested in the CCDF (gray 
circles). The lognormal fit for this subset is charac¬ 
terized by (/i, a) = (5.4, 1.7), with a tail extending 
towards lower /i, higher a, as is a general feature 
of these fits. Neither functional form yields tight 
parameter constraints in this case, suggesting the 
existence of substantial remaining inhomogeneities. 

The second row of Figure illustrates inter¬ 
mediate i?gai sources, mostly associated with the 
Sagittarius and Local arms. The wide spread in 
the mass distribution is a consequence of the large 


range of corresponding to this Galactocentric 
annulus. Although the maximum-likelihood power- 
law parameter values somewhat favor the high-mass 
end of the distribution (d = 2.02 ^^'for M > 
1600 Mq ), the middle panel demonstrates a long tail 
of Monte Carlo trials to low-a. While this spread 
in the power-law fit parameters seems to indicate 
a continuously curved mass distribution, the right 
panel shows no particular localization of the log¬ 
normal parameters. The maximum-likelihood pa¬ 
rameters, (/i, d) = (4.95,2.74), reasonably fits the 
wide plateau of the distribution, but the extremely 
heterogenous nature of this sample suggest further 
cuts are necessary. As a test, objects likely to be in 
the Local Arm < 2.8 kpc, see EB15) were ex¬ 
cluded from the fits shown in the third row of Fig¬ 
ure!^ While the power-law fit is marginally more 
constrained (d = 1-97^0 24 ^ ^ IbOOMQ), the 

lognormal fit is indistinguishable from the entire in- 
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termediate i?gai subset. 

Objects in the Perseus and Outer arms beyond 
the Solar circle are depicted in the bottom row of 
Figure where the small size of this subset {N = 
234) compared to the others leads to the sharp trun¬ 
cation in the power-law parameters around Mmin = 
400 Mq, with a maximum-likelihood a = 1.94tQ;5o- 
The lognormal fits (right panel) are constrained ap¬ 
proximately as well as for the complete sample, but 
with a slightly larger mean. This subset is generally 
drawn from the outer Galaxy targeted fields in the 
BGPS, and likely is not completely representative 
of the outermost regions of the Galactic disk. 

A.3. Clumps vs. Clouds 

Being shallower functions heliocentric distance, 
the derived physical radii and number densities of 
BGPS-detected dense molecular cloud structures 


can possibly divide the catalog into subsets more 
closely matched to physical distinctions in the in¬ 
terstellar medium (Figure 14, top right and bottom 
left). There are relatively objects falling in the 
“core” category {R < 0.125pc or n > 10^cm“^), 
and we combine these with the “clump” subset here. 
We therefore divide sources into “core/clump” (59% 
of the physical radius distribution and 73% of the 
number density distribution) and “cloud” subsets. 
The lack of a one-to-one mapping between these two 
physical tracers is likely due to variations in source 
geometry away from the cylindrical model used to 
compute n. Furthermore, there are likely obser¬ 
vational biases present that skew this relationship, 
including the blending of substructure into larger- 
radius co mplexes when convolv ed with the telescope 
beam (see Merello et al.||2015). 

Functional fits to the mass distributions are 
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Figure 19. Mass functions of subsets in mass surface density. The top two rows depict objects divided by mean surface 
density, either below Em = 120 M© pc“^ (top row, blue lines) or above (second row, red lines). The bottom two rows show 
sources divided by peak surface den sity, either below Sp = 120 Mq pc“^ (third row, green lines) or above (bottom row, pink 
lines). Columns are as in Figure 


shown in F ignres [Tt] (physical radius), and [^(num¬ 
ber density). Given the similarities between the 
mass distributions for the two methods of classifying 
dense molecular cloud structures, we discuss them 
together. The clump-scale objects are shown in the 
top rows of each figure. For both physical property 
criteria, the power-law fits stretch over wide range 
of parameter space with indices that span a range 
^ ^ ^ 2.2, although both have a maximum- 
likelihood value near a = 1.9. The lognormal fits 
also return similar values for the two clump crite¬ 
ria, with peaks at slightly lower mass and widths 
slightly larger than the lognormal fit to the com¬ 
plete data set. 


The two cloud criteria (bottom row of Figures 17 
and 18) also show great similarity in their functional 
fits.^The primary difference in the power-law fits 
comes from the smaller sample size for number den¬ 
sity group (A^ = 376 for clouds with n < 750 cm“^ 
versus N = 562 for the clouds with R > 1.25 pc) 


leading to the truncation of the joint confidence in¬ 
tervals at large Mmin in the bottom middle panel 
of Figure The lognormal fits are very similar, 
both to each other and to the clump fits in the top 
rows of these figures. These sets of fits suggest that 
dividing the dense molecular cloud structures into 
categories based on physical radius or number den¬ 
sity alone do not provide homogenous or distinct 
subsets for which meaningful mass functions may 
be fit that help constrain the theories of molecular 
cloud evolution. 


A.4. Mass Surface Density 

Investigation of Larson’s 3^^ Law in terms of de¬ 
viations away from a systematic M ^ curve 
showed that while the mean mass surface density 
of Distance Catalog objects spans less than two or¬ 
ders of magnitude around F = 100 Mq pc“^, there 
appears t o be a slight upward trend as a function 
of mass (^5.2.1 ). In studies of star forming regions, 
the value Ay = 8 mag is often cited as a threshold 
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Figure 20. Mass functions of subsets in velocity type. Top row includes objects whose was derived from spectroscopic 

observation of a dense gas tracer {e.g., HCO^ (3-2); lines in blue), and the bottom row are objec ts w hose kinematic information 
comes only from the isolation technique of EB15 (lines in red). Columns are as in Figure [l^ 


tracers HCO (3-2), NH .sfl,!)) in conce r t with 


a Galactic rotation curve. Shirley et ah (2013) 


for ongoing star fo rmation due t o self-shielding from 
ionizing radiation (McKee|1989). Depending on the 
dust physics assumptions used, this translates into 
a mass surface d ensity of X = 120 — 160 Mq pc“^ 

(Lada et al. 2013). The histogram of mean mass sur- 
face density tor Distance Gatalog sources is shown 
in Figure bottom center, and the vertical dashed 
line marks the Ay = 8 mag value. The top axis of 
this panel lists the X values in g cm“^, as this unit is 
discussed in terms of thresholds for star formation. 

In similar fashion, we considered the peak mass sur¬ 
face density, computed from the peak flux density 
reported in the BGPS catalog and the 33" size of 
the BGPS beam, and the resulting source histogram 
is shown in the bottom right panel of Figure [14] with 
similar markings as the mean mass surface dSisity. 

In Figure we show the mass distributions and 
functional nts for the four subsets based on mass 
surface density. The top row depicts the sources 
with mean S below 120 Mq pc“^, and comprises 
some 80% of the Distance catalog. This set is 
not well-fit by a power law (d = 2.38lo'36)5 t)ut 
does have a moderately constrained lognormal fit 
(yh, (j) = (5.1,1.7). The positive correlation be¬ 

tween mass and mean S can be seen in the differ¬ 
ences between the lognormal fits of these two sub¬ 
sets. 

The fits for the subsets divided by peak mass sur¬ 
face density are shown in the bottom two rows of 
Figure In this case, the bulk of the sources 
(78%) have peak S above the 120 Mq pc“^ thresh¬ 
old. The lack of high-mass sources in the third row 
of the figure indicates that there are no large diffuse 
sources whose total mass is large despite having low 
X; this is commensurate with the BGPS angular 
transfer function (G13). 

A.5. Type of Radial Velocity Measurement 

Kinematic distances for BGPS sources rely pri- _ _ 

marily upon 'p^sr measurements from dense gas et al. (2001), meaning they are typically nearby 


showed for BGPS sources that detection in one of 
the dense gas lines is a strong function of millime¬ 
ter flux density. To expand the available catalog of 
objects for which kinematic distances may be com¬ 
puted, FBI5 demonstrated a means for combining 
continuum data with diffuse-gas tracing GO(1-0) 
to extract p^sr agreement with the dense-gas val¬ 
ues at a rate of 95%. This method was applied to 
dense molecular cloud structures not detected in one 
of the dense-gas surveys to expand the number of 
objects in the BGPS Distance Gatalog. The type of 
Plsr assignment, therefore, offers an additional cut 
for investigating mass function fits. 

The top row of Figure shows the mass dis¬ 
tribution and functional fit parameters for the set 
of objects detected in one of the dense gas tracers 
(see EB15 for details). Since this set comprises 76% 
of the Distance Gatalog, it is unsurprising that the 
functional fits are nearly identical to those of the 
entire Gatalog shown in Figures and The set of 
sources identified only with ^^GO(l-0j"^is generally 
lower mass, and has less-constrained fits. 


A.6. Attempts to Find a ^FrotoclusteF' 
Function 


Mass 


In addition to the strict cuts along the physical 
quantities described in Figure additional sub¬ 
sets of the data were constructeT^in an attempt to 
define a “protocluster” mass function. The subsets 


are shown in Figure 21 along with their respective 
functional fits, and are discussed below. 

The first two subsets are the blindly surveyed por¬ 
tion of the BGPS {i < 90°) and all objects with 
physical radius R > 0.2 pc. The outer Galaxy por¬ 
tions of the BGPS were targeted specifically based 
on known regions of star formation or con centra¬ 
tions o f emis sion in the CO(1-0) maps of 
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Figure 21. Mass functions of protocluster-type subsets. Top row only includes objects in the blind portion of the survey 
{i < 90°; lines in blue), the second row objects of probable protocluster physical radius {R > 0.2pc; lines in red), the third 
row objects with a mixed set of criteria (2kpc < < lOkpc and M > 300 ; lines in green), and the bottom row the most 

high-mass objects (M > 2000M^; lines in pink). Columns are as in Figure |15| 


clouds and suffer significant selection bias. Uti¬ 
lizing sources only from the inner Galaxy, where 
survey coverage was not (generally) predicated on 
prior knowledge of the structure of emissionj^ al¬ 
lows for a more unbiased sampling of sources (and 
therefore mass distribution) for fitting a mass func¬ 
tion. The second subset removes all core-scale ob¬ 
jects and those with non-real solutions for Oji (which 
have low fl ux-density and therefore likely low mass, 
see ^4.1.1). The removal of low-mass objects does 
not appreciably affect the high-mass end of the mass 
distribution, and both subsets have power-law fit 
parameters a ^ 1.9, Mmin ~ 900 M^, consistent 
with the fit parameters for the complete set. The 
loci of points in middle panels of Figure ^T\ bear a 


The “generally” arises because of the several regions 
along the inner Galactic plane where the BGPS latitude cov¬ 
erage was flared out from the nominal \b\ < 0?5 to |6| < 1?5 
in areas wit h known large verti cal extent of star-formation 
activity (see [Aguirre et al.|2011| . 


strong resemblance to that in Figure The lognor¬ 
mal fits similarly reflect the entire sample, and do 
not add additional insight. 

The other two subsets shown in Figure be¬ 
gin to more strongly resemble power-law distri¬ 
butions. The third row is the subset of sources 
with 2kpc < < lOkpc and M > 300 

chosen to reflect a mixed criterion in search of 
protoclusters. These conditions remove the most 
nearby objects (likely cores) and most distant ob¬ 
jects (likely full clouds), and restricts considera¬ 
tion to a mass range consisten t with star clus¬ 
ters (Kennicutt & Evans [2012 ). The power-law 
fit joint confidence regions produce a narrow range 
of parameters with maximum-likelihood values at 
a = The power- 

law index is consistent with many of the fits, and 
extends to near the 90% mass completeness level. 
The lognormal fit displays an unusual set of con¬ 
tours, indicating that this functional form is not 
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an appropriate description of the mass distribution. 
Finally, the bottom row of Figure ^T\ illustrates the 
fit to just the high-mass end of the BGPS distribu¬ 
tion (M > 2000 Mq). The power-law fit parameters 
line up with the uppermost collection of points in 


the fit to the full sample, confirming the steep dis¬ 
tribution (d = 2.23lo!o6) these sources. While 



mal fit is tightly constrained, but since the minimum 
mass of this subset is substantially larger than the 
completeness levels, no real turnover in the data is 


expected. 
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